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1 GACGGATC GG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC 

CTGCCTAGCC CTCTAGAGGG CTAGGGGATA CCAGCTGAGA GTCATGTTAG 
51 TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 

AC GAG AC T AC GGCGTATCAA TTCGGTCATA GACGAGGGAC GAACACACAA 
101 GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG 

CCTCCAGCGA CTCATCACGC GCTCGTTTTA AATTCGATGT TGTTCCGTTC 
151 GCTTGACCGA CAATTGCATG AAGAATCTGC TTAGGGTTAG GCGTTTTGCG 

CGAACTGGCT GTTAACGTAC TTCTTAGACG AATCCCAATC CGCAAAACGC 
201 CTGCTTCGCG ATGTACGGGC C AG AT AT AC G CGTTGACATT GATTATTGAC 

GACGAAGCGC TACATGCCCG GTCTATATGC GCAACTGTAA CTAATAACTG 
251 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 

ATCAATAATT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 
3 01 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG 

ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 
3 51 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 

GGGTTGCTGG GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA 
401 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT 

TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTG ATAAATGC C A 
451 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 

TTTGACGGGT GAACCGTCAT GTAGTTCACA T AGTAT AC GG TTCATGCGGG 
501 CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA 

GGATAACTGC AGTTACTGCC ATTTAC CGGG CGGACCGTAA TACGGGTCAT 
551 CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 

GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT 
601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 

AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCGCACCT 
651 TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 

ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 
701 TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 

ACCCTCAAAC AAAACCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT 
751 ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 

TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 
801 GTC TATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA CTGCTTACTG 

CAGATATATT CGTCTCGAGA GACCGATTGA TCTCTTGGGT GACGAATGAC 
851 GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTTGGTACC 
CGAATAGCTT TAATTATGCT GAGTGATATC CCTCTGGGTT CGAACCATGG 
+3 M E T D T L L LW 

901 GAGCTCGGAT CGATATCGCC AC C ATGG AG A CAGACACACT CCTGCTATGG 
CTCGAGCCTA GCTATAGCGG TGGTACCTCT GTCTGTGTGA GG AC GAT AC C 

+3 V . L . L . L V V V , P , G , S T G , D A p D R s- 

951 GTACTGCTGC TCTGGGTTCC AGGTTCCACT GGTGACGCTC CGGACAGGAG 
CATGACGACG AGACCCAAGG TCCAAGGTGA CCACTGCGAG GCCTGTCCTC 
+3 SD Q . E P L , Y P V Q V S S A D A RL 
1001 TGACCAGGAG CCGCTGTACC CAGTGCAGGT CAGCTCTGCG GACGCTCGGC 
ACTGGTCCTC GGCGACATGG GTCACGTCCA GTCGAGACGC CTGCGAGCCG 
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+3 
1051 

+3 
1101 

+3 
1151 

+3 
1201 

+3 
1251 

+3 
1301 

+3 
1351 

+3 
1401 



• L M V 



W 



s s 



• l_ IVI V ' r „ , — , « f « ■ 

tcatg'gtc'tt tgacaagacg gaagggacgt ggcggctgct gtgctcctcg 
Igtaccagaa actgttctgc cttccctgca ccgccgacga cacgaggagc 



S N 



+3 
1451 

+3 
1501 

+3 
1551 

+3 
1601 



+3 

1651 

+3 
1701 

+3 
1751 



CGCTCTAACG CCAGGGTAGC CGGACTCAGC TGCGAGGAGA TGGGCTTCCT 
GCGAGATTGC GGTCCCATCG GCCTGAGTCG ACGCTCCTCT ACCCGAAGGA 
. L R A L THS E LP V RT A GANG 

™ccgca*ctg ACCCACTCCG AGCTGGACGT gcgaacggcg GGCGCCAATG 
ScccSSc Sgtgaggc tcgacctgca cgcttgccgc ccgcggttac 

. g T S G FF C VD E G RL P HT 

gcacg'tcggg cttcttctgt gtggacgagg ggaggctgcc ccacacccag 
cctgcIgccc gaagaagaca cacctgctcc cctccgacgg ggtgtgggtc 

R L L E V I S V C D C P R G R FL- 

'aggctgctgg ag^atctc cgtgtgtgat tgccccagag gccgtttctt 
Sgacgacc tccagtagag gcacacacta acggggtctc cggcaaagaa 
. l a a.cqd cgrrklp vddp 
ggccgcc'atc tgccaagact gtggccgcag gaagctgccc gtggacgacg 
ccScgSIg acggttctga caccggcgtc cttcgacggg cacctgctgc 

• o D K IVGG RDTSLG RWPW 

ACGACAAGAT CGTGGGAGGC CGGGACACCA GCTTGGGCCG GTGGCCGTGG 
TGCTGTTCTA GCACCCTCCG GCCCTGTGGT CGAACCCGGC CACCGGCACC 
q V S L RY D GA H LCG G S L L 
caa'gtcagcc TTCGCTATGA TGGAGCACAC CTCTGTGGGG GATCCCTGCT 

g^SScgg aagcgatact acctcgtgtg gagacacccc ctagggacga 
. l s g d wvl t aa h cf p ernr 
ctccggg'gac "tgggtgctga cagccgccca ctgcttcccg gagcggaacc 
gaggcccctg acccacgact gtcggcgggt gacgaagggc ctcgccttgg 
• r V L S RW r vfa g av a qas 
gggtcctg'tc ccgatggcga gtgtttgccg gtgccgtggc ccaggcctct 
cccaggacag ggctaccgct cacaaacggc cacggcaccg ggtccggaga 

P H G L QL G VQ A VVY H G G Y- 
CCCCACGGTC TGCAGCTGGG GGTGCAGGCT GTGGTCTACC ACGGGGGCTA 

gggSgccag acgtcgaccc ccacgtccga caccagatgg tgcccccgat 

. Y L P F RDP N SE E NSN DIAL 

TCTTCCCTTT CGGGACCCCA ACAGCGAGGA GAACAGCAAC GATATTGCCC 

IgIIgggaaa gccctggggt tgtcgctcct cttgtcgttg ctataacggg 

LVH L SS P LPL T EY I Q P V 



TGGTC'CACCT CTCCAGTCCC CTGCCCCTCA CAGAATACAT CCAGCCTGTG 

IccIggJggI gaggtcaggg gacggggagt gtcttatgta ggtcggacac 

p A A G Q A L V P G K 1 . . , , 

tgcctc'ccag ctgccggcca ggccctggtg gatggcaaga tctgtaccgt 
IcggaggSc gacggccggt ccgggaccac ctaccgttct agacatggca 

V T G W GNT O YY G QQ A GVLQ 
gacgggctgg GGCAACACGC AGTACTATGG CCAACAGGCC GGGGTACTCC 
Sgcccgacc CCGTTGTGCG TCATGATACC GGTTGTCCGG CCCCATGAGG 



pIRESpuro2W/hepEK_k 



+3 


•Q E A R 


V P I 


1 S N D V C N 


GAD 


1801 


AGGAGGCTCG 


AGTCCCCATA 


ATCAGCAATG 


ATGTCTGCAA 


TGGCGCTGAT 




TCCTCCGAGC 


TC AGGGGT AT 


TAGTCGTTAC 


TACAGACGTT 


ACCGCGACTA 


+3 


F Y G N Q 1 K 


P K M 


F C A G Y P E 


1851 


TTCTATGGAA 


ACCAGATCAA 


GCCCAAGATG 


TTCTGTGCTG 


GCTACCCCGA 




AAG AT AC CTT 


TGGTCTAGTT 


CGGGTTCTAC 


AAGACACGAC 


CGATGGGGCT 


+3 


•EGG 1 


D A C Q G D S 


G G P 


F V C E 


1901 


GGGTGGCATT 


GATGCCTGCC 


AGGGCGACAG 


CGGTGGTCCC 


TTTGTGTGTG 




CCCACCGTAA 


CTACGGACGG 


TCCCGCTGTC 


GCCACCAGGG 


AAACACACAC 


+3 


■EDS 1 


S R T 


P R W 1 


R L C G 


1 V S 


1951 


AGGACAGCAT 


CTCTCGGACG 


CCACGTTGGC 


GGCTGTGTGG 


C ATTGTG AGT 




TCCTGTCGTA 


GAGAGCCTGC 


GGTGCAACCG 


CCGACACACC 


GTAACACTCA 


+3 


W G T 


G C A L 


A Q K 


P G V 


Y T K V- 


9 001 


TGGGGCACTG 


GCTGTGCCCT 


GGCCCAGAAG 


CCAGGCGTCT 


AC AC C AAAGT 




ACCCCGTGAC 


CGACACGGGA 


CCGGGTCTTC 


GGTCCGCAGA 


TGTGGTTTCA 


+3 


•V S D F 


R E W 


1 F Q A 


1 K T 


H S E A 




CAGTGACTTC 


CGGGAGTGGA 


TCTTCCAGGC 


CATAAAGACT 


CACTCCGAAG 




GTCACTGAAG 


GCCCTCACCT 


AGAAGGTCCG 


GTATTTCTGA 


GTGAGGCTTC 


+3 


•AS G W 


1 V T Q 


L E F 


G K P 1 


P N P 


2101 


CCAGCGGCAT 


GGTGACCCAG 


CTCGAATTCG 


GTAAGCCTAT 


CCCTAACCCT 




GGTCGCCGTA 


CCACTGGGTC 


GAGCTTAAGC 


CATTCGGATA 


GGGATTGGGA 


+3 


L L G 


L D S T 


R T G 


H H H 


H H H 


2151 


CTCCTCGGTC 


TCGATTCTAC 


GCGTACCGGT 


CATCATCACC 


ATCACCATTG 




GAGGAGCCAG 


AGCTAAGATG 


CGCATGGCCA 


GTAGTAGTGG 


TAGTGGTAAC 


2201 


AGTTTAAAGC 


GGCCGCATAG 


ATAACTGATC 


CAGTGTGCTG 


GAATTAATTC 




TCAAATTTCG 


CCGGCGTATC 


TATTGACTAG 


GTCACACGAC 


CTTAATTAAG 


2251 


GCTGTCTGCG 


AGGGCCAGCT 


GTTGGGGTGA 


GTACTCCCTC 


TCAAAAGCGG 




C G AC AG AC GC 


TCCCGGTCGA 


CAACCCCACT 


CATGAGGGAG 


AGTTTTCGCC 



2301 GCATGACTTC TGCGCTAAGA TTGTCAGTTT CCAAAAACGA GGAGGATTTG 
C GT AC TG AAG ACGCGATTCT AACAGTCAAA GGTTTTTGCT CCTCCTAAAC 

2351 ATATTCACCT GGCCCGCGGT GATGCCTTTG AGGGTGGCCG CGTCCATCTG 
TATAAGTGGA CCGGGCGCCA CTACGGAAAC TCCCACCGGC GCAGGTAGAC 



2401 GTCAGAAAAG ACAATCTTTT TGTTGTCAAG CTTGAGGTGT GGCAGGCTTG 

CAGTCTTTTC TGTTAGAAAA ACAACAGTTC GAACTCCACA CCGTCCGAAC 
2451 AGATCTGGCC AT AC ACTTG A GTGACAATGA CATCCACTTT GCCTTTCTCT 

TCTAGACCGG TATGTGAACT CACTGTTACT GTAGGTGAAA CGGAAAGAGA 
2501 C C AC AGGTGT CCACTCCCAG GTCCAACTGC AGGTCGAGCA TGCATCTAGG 

GGTGTCCACA GGTGAGGGTC CAGGTTGACG TCCAGCTCGT ACGTAGATCC 
2551 GCGGCCAATT CCGCCCCTCT CCCTCCCCCC CCCCTAACGT TACTGGCCGA 

CGCCGGTTAA GGCGGGGAGA GGGAGGGGGG GGGGATTGCA ATGACCGGCT 
2601 AGCCGCTTGG AATAAGGCCG GTGTGCGTTT GTCTATATGT GATTTTCCAC 

TCGGCGAACC TTATTCCGGC CACACGCAAA CAGATATACA CTAAAAGGTG 
2651 CATATTGCCG TCTTTTGGCA ATGTGAGGGC CCGGAAACCT GGCCCTGTCT 

GTATAACGGC AGAAAACCGT TACACTCCCG GGCCTTTGGA C C GGG AC AG A 
2701 TCTTGACGAG CATTCCTAGG GGTCTTTCCC CTCTCGCCAA AGGAATGCAA 

AGAACTGCTC GTAAGGATCC CCAGAAAGGG GAGAGCGGTT TCCTTACGTT 



RfiiOWc V3 
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27 51 GGTCTGTTGA ATGTCGTGAA GGAAGCAGTT CCTCTGGAAG CTTCTTGAAG 
CCAGACAACT TACAGCACTT CCTTCGTCAA GGAGACCTTC GAAGAACTTC 
2801 ACAAACAACG TCTGTAGCGA CCCTTTGCAG GCAGCGGAAC CCCCCACCTG 
TGTTTGTTGC AGACATCGCT GGGAAACGTC CGTCGCCTTG GGGGGTGGAC 
2851 GCGACAGGTG CCTCTGCGGC CAAAAGCCAC GTGTATAAGA TACACCTGCA 
CGCTGTCCAC GG AG AC GC C G GTTTTCGGTG C AC ATATTC T ATGTGGACGT 
2901 AAGGCGGCAC AACCCCAGTG C C AC GTTGTG AGTTGGATAG TTGTGGAAAG 
TTCCGCCGTG TTGGGGTCAC GGTGCAACAC TCAACCTATC AACACCTTTC 
2951 AGTCAAATGG CTCTCCTCAA GCGTATTCAA CAAGGGGCTG AAGGATGCCC 
TCAGTTTACC GAGAGGAGTT CGCATAAGTT GTTCCCCGAC TTCCTACGGG 
3001 AG AAGGT AC C CCATTGTATG GGATCTGATC TGGGGCCTCG GTGCACATGC 
TCTTCCATGG GGTAACATAC CCTAGACTAG ACCCCGGAGC C ACGTGT AC G 
3051 TTTACATGTG TTT AGTC GAG GTTAAAAAAA CGTCTAGGCC CCCCGAACCA 
AAATGTACAC AAATCAGCTC CAATTTTTTT GCAGATCCGG GGGGCTTGGT 
3101 CGGGGACGTG GTTTTCCTTT GAAAAACACG ATGATAAGCT TGCCACAACC 
GCCCCTGCAC CAAAAGGAAA CTTTTTGTGC TACTATTCGA ACGGTGTTGG 
3151 CACAAGGAGA CGACCTTCCA TG AC CGAGT A CAAGCCCACG GTGCGCCTCG 
GTGTTCCTCT GCTGGAAGGT ACTGGCTCAT GTTCGGGTGC CACGCGGAGC 
3201 CCACCCGCGA CGACGTCCCC CGGGCCGTAC GCACCCTCGC CGCCGCGTTC 
GGTGGGCGCT GCTGCAGGGG GCCCGGCATG CGTGGGAGCG GCGGCGCAAG 
3251 GCCGACTACC CCGCCACGCG CCACACCGTC GACCCGGACC GCCACATCGA 
CGGCTGATGG GGCGGTGCGC GGTGTGGCAG CTGGGCCTGG CGGTGTAGCT 
3301 GCGGGTCACC GAGCTGCAAG AACTCTTCCT CACGCGCGTC GGGCTCGACA 
CGCCCAGTGG CTCGACGTTC TTGAGAAGGA GTGCGCGCAG CCCGAGCTGT 
3351 TCGGC AAGGT GTGGGTC GCG GACGACGGCG CCGCGGTGGC GGTCTGGACC 
AGCCGTTCCA CACCCAGCGC CTGCTGCCGC GGCGCCACCG CCAGACCTGG 
3401 ACGCCGGAGA GCGTCGAAGC GGGGGCGGTG TTCGCCGAGA TCGGCCCGCG 
TGCGGCCTCT CGCAGCTTCG CCCCCGCCAC AAGCGGCTCT AGCCGGGCGC 
3451 CATGGCCGAG TTGAGCGGTT CCCGGCTGGC CGCGCAGCAA CAGATGGAAG 
GTACCGGCTC AACTCGCCAA GGGCCGACCG GCGCGTCGTT GTCTACCTTC 
3501 GCCTCCTGGC GCCGCACCGG CCCAAGGAGC CCGCGTGGTT CCTGGCCACC 
CGGAGGACCG CGGCGTGGCC GGGTTCCTCG GGCGCACCAA GGACCGGTGG 
3 551 GTCGGCGTCT CGCCCGACCA CCAGGGCAAG GGTCTGGGCA GCGCCGTCGT 
CAGCCGCAGA GCGGGCTGGT GGTCCCGTTC CCAGACCCGT CGCGGCAGCA 
3 601 GCTCCCCGGA GTGGAGGCGG CCGAGCGCGC CGGGGTGCCC GCCTTCCTGG 
CGAGGGGCCT CACCTCCGCC GGCTCGCGCG GCCCCACGGG CGGAAGGACC 
3651 AGACCTCCGC GCCCCGCAAC CTCCCCTTCT ACGAGCGGCT CGGCTTCACC 
TCTGGAGGCG C GGGGC GTTG GAGGGGAAGA TGCTCGCCGA GCCGAAGTGG 
3701 GTCACCGCCG ACGTCGAGTG CCCGAAGGAC CGCGCGACCT GGTGCATGAC 
CAGTGGCGGC TGCAGCTCAC GGGCTTCCTG GCGCGCTGGA CCACGTACTG 
37 51 CCGCAAGCCC GGTGCCTGAC GCCCGCCCCA CGACCCGCAG CGCCCGACCG 
GGCGTTCGGG CCACGGACTG CGGGCGGGGT GCTGGGCGTC GCGGGCTGGC 
3801 AAAGGAGCGC ACGACCCCAT GGCTCCGACC GAAGCCGACC CGGGCGGCCC 
TTTCCTCGCG TGCTGGGGTA CCGAGGCTGG CTTCGGCTGG GCCCGCCGGG 
3 851 CGCCGACCCC GCACCCGCCC CCGAGGCCCA CCGACTCTAG ACTCGAGATC 
GCGGCTGGGG CGTGGGCGGG GGCTCCGGGT GGCTGAGATC TGAGCTCTAG 
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3 9 01 GATAATCAAC CTCTGGATTA CAAAATTTGT GAAAGATTGA CTGGTATTCT 
CTATTAGTTG GAGACCTAAT GTTTTAAACA CTTTCTAACT G AC C ATAAG A 
3951 TAACTATGTT GCTCCTTTTA CGCTATGTGG ATACGCTGCT TTAATGCCTT 
ATTGATACAA CGAGGAAAAT GCGATACACC TATGCGACGA AATTACGGAA 
4001 TGTATCATGC TATTGCTTCC CGTATGGCTT TCATTTTCTC CTC CTTGT AT 
AC AT AGT AC G ATAACGAAGG GCATACCGAA AGTAAAAGAG GAGGAACATA 
4051 AAATCCTGGT TGCTGTCTCT TTATGAGGAG TTGTGGCCCG TTGTCAGGCA 
TTTAGGACCA AC G AC AG AG A AATACTCCTC AAC AC CGGGC AACAGTCCGT 
4101 ACGTGGCGTG GTGTGCACTG TGTTTGCTGA CGCAACCCCC ACTGGTTGGG 
TGCACCGCAC CACACGTGAC ACAAACGACT GCGTTGGGGG TGACCAACCC 
4151 GCATTGCCAC CACCTGTCAG CTCCTTTCCG GGACTTTCGC TTTCCCCCTC 
CGTAACGGTG GTGGACAGTC GAGGAAAGGC CCTGAAAGCG AAAGGGGGAG 
4201 CCTATTGCCA CGGCGGAACT CATCGCCGCC TGCCTTGCCC GCTGCTGGAC 
GGATAACGGT GCCGCCTTGA GTAGCGGCGG ACGGAACGGG CGACGACCTG 
4251 AGGGGCTCGG CTGTTGGGCA CTGACAATTC CGTGGTGTTG TCGGGGAAAT 
TCCCCGAGCC GACAACCCGT GACTGTTAAG GCACCACAAC AGCCCCTTTA 
43 01 CATCGTCCTT TCCTTGGCTG CTCGCCTGTG TTGCCACCTG GATTCTGCGC 
GTAGCAGGAA AGG AAC C G AC GAGCGGACAC AAC GGTGG AC CTAAGACGCG 
4351 GGGACGTCCT TCTGCTACGT CCCTTCGGCC CTCAATCCAG CGGACCTTCC 
CCCTGCAGGA AGACGATGCA GGGAAGCCGG GAGTTAGGTC GCCTGGAAGG 
4401 TTCCCGCGGC CTGCTGCCGG CTCTGCGGCC TCTTCCGCGT CTTCGCCTTC 
AAGGGCGCCG GACGACGGCC GAGACGCCGG AGAAGGCGCA GAAGCGGAAG 
4451 GCCCTCAGAC GAGTCGGATC TCCCTTTGGG CCGCCTCCCC GCCTGATCGA 
CGGGAGTCTG CTCAGCCTAG AGGGAAACCC GGCGGAGGGG CGGACTAGCT 
4501 TCTAGAGCTC GCTGATCAGC CTCGACTGTG CCTTCTAGTT GCCAGCCATC 
AGATCTCGAG CGACTAGTCG GAGCTGACAC GGAAGATCAA CGGTCGGTAG 
4551 TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 
ACAACAAACG GGGAGGGGGC ACGGAAGGAA CTGGGACCTT CCACGGTGAG 
4601 CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT 
GGTGACAGGA AAGGATTATT TTACTCCTTT AACGTAGCGT AACAGACTCA 
4651 AGGTGTCATT CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA 
TCCACAGTAA GATAAGACCC CCCACCCCAC CCCGTCCTGT CGTTCCCCCT 
4701 GGATTGGGAA GACAATAGCA GGCATGCTGG GGATGCGGTG GGCTCTATGG 
CCTAACCCTT CTGTTATCGT CCGTACGACC CCTACGCCAC C C GAG AT AC C 
4751 CTTCTGAGGC GGAAAGAACC AGCTGGGGCT CGAGTGCATT CTAGTTGTGG 
GAAGACTCCG CCTTTCTTGG TCGACCCCGA GCTCACGTAA GATCAACACC 
4801 TTTGTC C AAA CTCATCAATG TATCTTATCA TGTCTGTATA CCGTCGACCT 
AAAC AGGTTT GAGTAGTTAC ATAGAATAGT ACAGACATAT GGCAGCTGGA 
4851 CTAGCTAGAG CTTGGCGTAA TCATGGTCAT AGCTGTTTCC TGTGTGAAAT 
GATCGATCTC G AAC C GC ATT AGTACCAGTA TCGACAAAGG ACACACTTTA 
4901 TGTTATCCGC TCACAATTCC ACACAACATA CGAGCCGGAA GCATAAAGTG 
ACAATAGGCG AGTGTTAAGG TGTGTTGTAT GCTCGGCCTT CGTATTTCAC 
4951 TAAAGCCTGG GGTGCCTAAT GAGTGAGCTA ACTCACATTA ATTGCGTTGC 
ATTTCGGACC CCACGGATTA CTCACTCGAT TGAGTGTAAT TAACGCAACG 
5 001 GCTCACTGCC CGCTTTCCAG TCGGGAAACC TGTCGTGCCA GCTGCATTAA 
CGAGTGACGG GCGAAAGGTC AGCCCTTTGG ACAGCACGGT CGACGTAATT 
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5051 TGAATCGGCC AACGCGCGGG GAGAGGCGGT TTGCGTATTG GGCGCTCTTC 

ACTTAGCCGG TTGCGCGCCC CTCTCCGCCA AACGCATAAC CCGCGAGAAG 
5101 CGCTTCCTCG C TC AC TG AC T CGCTGCGCTC GGTCGTTCGG CTGCGGCGAG 

GCGAAGGAGC GAGTGACTGA GCGACGCGAG CCAGCAAGCC GACGCCGCTC 
5151 CGGTATCAGC TCACTCAAAG GCGGTAATAC GGTTATCCAC AGAATCAGGG 

GCCATAGTCG AGTGAGTTTC CGCCATTATG CCAATAGGTG TCTTAGTCCC 
5201 GATAACGCAG GAAAGAACAT GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA 

CTATTGCGTC CTTTCTTGTA CACTCGTTTT CCGGTCGTTT TCCGGTCCTT 
5251 CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT CCATAGGCTC CGCCCCCCTG 

GGCATTTTTC CGGCGCAACG ACCGCAAAAA GGTATCCGAG GCGGGGGGAC 
5301 ACGAGCATCA CAAAAATCGA CGCTCAAGTC AGAGGTGGCG AAACCCGACA 

TGCTCGTAGT GTTTTTAGCT GCGAGTTCAG TCTCCACCGC TTTGGGCTGT 
5351 GGACTATAAA GAT AC C AGGC GTTTCCCCCT GGAAGCTCCC TCGTGCGCTC 

CCTGATATTT CTATGGTCCG CAAAGGGGGA CCTTCGAGGG AGCACGCGAG 
5401 TCCTGTTCCG ACCCTGCCGC TTACCGGATA CCTGTCCGCC TTTCTCCCTT 

AGGACAAGGC TGGGACGGCG AATGGCCTAT GGACAGGCGG AAAGAGGGAA 
5451 CGGGAAGCGT GGCGCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 

GCCCTTCGCA CCGCGAAAGA GTTACGAGTG CGACATCCAT AGAGTCAAGC 
5501 GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 

CACATCCAGC AAGCGAGGTT CGACCCGACA CACGTGCTTG GGGGGCAAGT 
5 551 GCCCGACCGC TGCGCCTTAT CCGGTAACTA TCGTCTTGAG TCCAACCCGG 

CGGGCTGGCG ACGCGGAATA GGCCATTGAT AGCAGAACTC AGGTTGGGCC 
5601 TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA CAGGATTAGC 

ATTCTGTGCT GAATAGCGGT GACCGTCGTC GGTG AC C ATT GTCCTAATCG 
5 651 AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 

TCTCGCTCCA TACATCCGCC ACGATGTCTC AAGAACTTCA C C AC C GG ATT 
5701 CTACGGCTAC AC T AG AAGG A CAGTATTTGG TATCTGCGCT CTGCTGAAGC 

GATGCCGATG TGATCTTCCT GTCATAAACC ATAGACGCGA GACGACTTCG 
5751 CAGTTACCTT C GG AAAAAG A GTTGGTAGCT CTTGATCCGG CAAACAAACC 

GTCAATGGAA GCCTTTTTCT CAACCATCGA GAACTAGGCC GTTTGTTTGG 
5801 ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 

TGGCGACCAT CGCCACCAAA AAAACAAACG TTCGTCGTCT AATGCGCGTC 
5851 AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG 

TTTTTTTCCT AGAGTTCTTC TAGGAAACTA GAAAAGATGC CCCAGACTGC 
5901 CTCAGTGGAA CGAAAACTCA CGTTAAGGGA TTTTGGTCAT GAGATTATCA 

GAGTCACCTT GCTTTTGAGT GCAATTCCCT AAAACCAGTA CTCTAATAGT 
5951 AAAAGGATCT TCACCTAGAT CCTTTTAAAT TAAAAATGAA GTTTTAAATC 

TTTTCCTAGA AGTGGATCTA GGAAAATTTA ATTTTTACTT CAAAATTTAG 
6001 AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC CAATGCTTAA 

TTAGATTTCA TATATACTCA TTTGAACCAG ACTGTCAATG GTTACGAATT 
6051 TCAGTGAGGC ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT 

AGTCACTCCG TGGATAGAGT CGCTAGACAG ATAAAGCAAG TAGGTATCAA 
6101 GCCTGACTCC CCGTCGTGTA GATAACTACG ATACGGGAGG GCTT AC CATC 

CGGACTGAGG GGCAGCACAT CTATTGATGC TATGCCCTCC CGAATGGTAG 
6151 TGGCCCCAGT GCTGCAATGA TACCGCGAGA CCCACGCTCA CCGGCTCCAG 

ACCGGGGTCA CGACGTTACT ATGGCGCTCT GGGTGCGAGT GGCCGAGGTC 
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6201 ATTTATCAGC AATAAACCAG CCAGCCGGAA GGGCCGAGCG C AG AAGTGGT 

TAAATAGTCG TTATTTGGTC GGTCGGCCTT CCCGGCTCGC GTCTTCACCA 
6251 CCTGCAACTT TATCCGCCTC CATCCAGTCT ATTAATTGTT GCCGGGAAGC 

GGACGTTGAA ATAGGCGGAG GTAGGTCAGA TAATTAACAA CGGCCCTTCG 
6301 TAGAGTAAGT AGTTCGCCAG TTAATAGTTT GCGCAACGTT GTTGCCATTG 

ATCTCATTCA TCAAGCGGTC AATTATCAAA CGCGTTGCAA CAACGGTAAC 
6351 CTACAGGCAT CGTGGTGTCA CGCTCGTCGT TTGGTATGGC TTCATTCAGC 

GATGTCCGTA GCACCACAGT GCGAGCAGCA AACCATACCG AAGTAAGTCG 
6401 TCCGGTTCCC AACGATCAAG GCGAGTTACA TGATCCCCCA TGTTGTGCAA 

AGGCCAAGGG TTGCTAGTTC CGCTCAATGT ACTAGGGGGT ACAACACGTT 
6451 AAAAGCGGTT AGCTCCTTCG GTCCTCCGAT CGTTGTCAGA AGTAAGTTGG 

TTTTCGCCAA TCGAGGAAGC CAGGAGGCTA GCAACAGTCT TCATTCAACC 
6501 CCGCAGTGTT ATCACTCATG GTTATGGCAG CACTGCATAA TTCTCTTACT 

GGCGTCACAA TAGTGAGTAC CAATACCGTC GTGACGTATT AAGAGAATGA 
6551 GTCATGCCAT CCGTAAGATG CTTTTCTGTG AC TGGTG AGT ACTCAACCAA 

CAGTACGGTA GGCATTCTAC GAAAAGACAC TGACCACTCA TGAGTTGGTT 
6601 GTCATTCTGA GAATAGTGTA TGCGGCGACC GAGTTGCTCT TGCCCGGCGT 

CAGTAAGACT CTTATCACAT ACGCCGCTGG CTCAACGAGA ACGGGCCGCA 
6651 CAATACGGGA TAATACCGCG CCACATAGCA GAACTTTAAA AGTGCTCATC 

GTTATGCCCT ATTATGGCGC GGTGTATCGT CTTGAAATTT TCACGAGTAG 
6701 ATTGGAAAAC GTTCTTCGGG GCGAAAACTC TCAAGGATCT TACCGCTGTT 

TAACCTTTTG CAAGAAGCCC CGCTTTTGAG AGTTCCTAGA ATGGCGACAA 
6751 GAGATCCAGT TCGATGTAAC CCACTCGTGC ACCCAACTGA TCTTCAGCAT 

CTCTAGGTCA AGCTACATTG GGTGAGCACG TGGGTTGACT AGAAGTCGTA 
6801 CTTTTACTTT CACCAGCGTT TC TGGGTG AG CAAAAACAGG AAGGC AAAAT 

GAAAATGAAA GTGGTCGCAA AGACCCACTC GTTTTTGTCC TTC CGTTTT A 
6851 GCCGCAAAAA AGGGAATAAG GGCGACACGG AAATGTTGAA TACTCATACT 

CGGCGTTTTT TCCCTTATTC CCGCTGTGCC TTTACAACTT ATGAGTATGA 
6901 CTTCCTTTTT CAATATTATT GAAGCATTTA TCAGGGTTAT TGTCTCATGA 
GAAGGAAAAA GTTATAATAA CTTCGTAAAT AGTCCCAATA ACAGAGTACT 
6951 GCGGATACAT ATTTGAATGT ATTTAGAAAA ATAAACAAAT AGGGGTTCCG 
CGCCTATGTA TAAACTTACA TAAATCTTTT TATTTGTTTA TCCCCAAGGC 
7 001 CGCACATTTC CCCGAAAAGT GCCACCTGAC GTC 
GCGTGTAAAG GGGCTTTTCA CGGTGGACTG CAG 
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AG AAC GGT AG 
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7 51 TCAGCACCCA GTCCCCGGAG AGCAGGGATC CCCCACAGAG GTGTGCTCCA 
AGTCGTGGGT CAGGGGCCTC TCGTCCCTAG GGGGTGTC TC C AC AC GAG G T 
-3 TLVWDGS L LSGGC L HAGD 
801 TCATAGCGAA GGCTGACTTG CCACGGCCAC CGGCCCAAGC TGGTGTCCCG 
AGTATCGCTT CCGACTGAAC GGTGCCGGTG GCCGGGTTCG AC C AC AGGGC 
-3 DYR L S V QWPW R GL S TDR 
851 GCCTCCCACG ATCTTGTCGT CGTCGTCCAC GGGCAGCTTC CTGCGGCCAC 
CGGAGGGTGC TAGAACAGCA GCAGCAGGTG CCCGTCGAAG GACGCCGGTG 
-3 GGV I KD D DD V PLKR R G C • 
9 01 AGTCTTGGCA GATGGCGGCC AAGAAACGGC CTCTGGGGCA ATCACACACG 
TCAGAACCGT CTACCGCCGG TTCTTTGCCG GAG AC C C C GT TAGTGTGTGC 
-3 CDQ C I A A L F R G RP C DCVS 
951 GAGATGACCT CCAGCAGCCT CTGGGTGTGG GGCAGCCTCC CCTCGTCCAC 
CTCTACTGGA GGTCGTCGGA GACCCACACC CCGTCGGAGG GGAGCAGGTG 
-3 SIV E LL R QTH P LRG EDV 
1001 ACAGAAGAAG CCCGACGTGC CATTGGCGCC CGCCGTTCGC ACGTCCAGCT 
TGTCTTC TTC GGGCTGC AC G GTAACCGCGG GCGGCAAGCG TGCAGGTCGA 
-3 CFFGSTGNAGATRVDLE- 
1051 CGGAGTGGGT CAGTGCCCTG AGGAAGCCCA TCTCCTCGCA GCTGAGTCCG 
GCCTCACCCA GTC AC GGG AC TCCTTCGGGT AGAGGAGCGT CGACTCAGGC 
-3 • E S H T LAR L FG M EE C SLGA 
1101 GCTACCCTGG CGTTAGAGCG CGAGGAGCAC AGCAGCCGCC ACGTCCCTTC 
C G ATGGG AC C GCAATCTCGC GCTCCTCGTG TCGTCGGCGG TGCAGGGAAG 
-3 • A V R A NS R SSC L LRW TGE 
1151 CGTCTTGTCA AAGACCATGA GCCGAGCGTC CGCAGAGCTG ACCTGCACTG 
GCAGAACAGT TTCTGGT AC T CGGCTCGCAG GCGTCTCGAC TGGACGTGAC 
-3 TKDFVMLRADASSVQVP- 
1201 GGTACAGCGG CTCCTGGTCA CTCCTATCCG GAGCGTCACC AGTGGAACCT 
CCATGTCGCC GAGGACCAGT GAGGATAGGC CTCGCAGTGG TC AC CTTGG A 
-3 PYL P EQD S RD P AD G TSGP 
1251 GGAACCCAGA GCAGCAGTAC CCATAGCAGG AGTGTGTCTG TCTCCATGGT 
CCTTGGGTCT CGTCGTCATG GGTATCGTCC TC AC AC AG AC AGAGGTA CCA 
-3 P V W L LL VWLL L TD T EM 
1301 GGCGATCTGG TACCCAGCTT CTAGAGATCT GACGGTTCAC TAAACGAGCT 
CCGCTAGACC ATGGGTCGAA GATCTCTAGA CTGCCAAGTG ATTTGCTCGA 
1351 CTGCTTATAT AGACCTCCCA CCGTACACGC CTACCGCCCA TTTGCGTCAA 
GACGAATATA TCTGGAGGGT GGCATGTGCG GATGGCGGGT AAACGCAGTT 
1401 CGGGGCGGGG TTATTACGAC ATTTTGGAAA GTCCCGTTGA TTTTGGTGCC 
GCCCCGCCCC AATAATGCTG TAAAACCTTT CAGGGCAACT AAAACCACGG 
1451 AAAACAAACT CCCATTGACG TCAATGGGGT GGAGACTTGG AAATCCCCGT 
TTTTGTTTGA GGGTAACTGC AGTTACCCCA CCTCTGAACC TTTAGGGGCA 
1501 GAGTCAAACC GCTATCCACG CCCATTGGTG TACTGCCAAA ACCGCATCAC 
CTCAGTTTGG CGATAGGTGC GGGTAACCAC ATGACGGTTT TGGCGTAGTG 
1551 CATGGTAATA GCGATGACTA AT AC GT AG AT GTACTGCCAA GTAGGAAAGT 
GTACCATTAT CGCTACTGAT TATGCATCTA CATGACGGTT CATC CTTTC A 
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1601 CCCGTAAGGT CATGTACTGG GCATAATGCC AGGCGGGCCA TTTACCGTCA 

GGGCATTCCA GTACATGACC C GT ATT AC GG TCCGCCCGGT AAATGGCAGT 
1651 TTGACGTCAA TAGGGGGCGG ACTTGGCATA TG AT AC AC TT GATGTACTGC 

AACTGCAGTT ATCCCCCGCC TGAACCGTAT ACTATGTGAA CTACATGACG 
1701 CAAGTGGGCA GTTTACCGTA AATACTCCAC CCATTGACGT CAATGGAAAG 

GTTCACCCGT CAAATGGCAT TTATGAGGTG GGTAACTGCA GTTACCTTTC 
1751 TCCCTATTGG CGTTACTATG GGAACATACG TCATTATTGA CGTCAATGGG 

AGGGATAACC GCAATGATAC CCTTGTATGC AGTAATAACT GCAGTTACCC 
1801 CGGGGGTCGT TGGGCGGTCA GCCAGGCGGG CCATTTACCG TAAGTTATGT 

GCCCCCAGCA ACCCGCCAGT CGGTCCGCCC GGTAAATGGC ATTCAATACA 
1851 AACGCGGAAC TCCATATATG GGCTATGAAC TAATGACCCC GTAATTGATT 

TTGCGCCTTG AGGTATATAC CCGATACTTG ATTACTGGGG CATTAACTAA 
1901 ACTATTAATA ACTAGTCAAT AATCAATGTC AACATGGCGG TCATATTGGA 

TGATAATTAT TGATCAGTTA TTAGTTACAG TTGTACCGCC AGTATAACCT 
1951 CATGAGCCAA TATAAATGTA CATATTATGA TATAGATACA ACGTATGCAA 

GTACTCGGTT ATATTTACAT GTATAATACT ATATCTATGT TGCATACGTT 
2001 TGGCCAATAG CCAATATTGA TTTATGCTAT ATAACCAATG ACTAATATGG 

ACCGGTTATC GGTTATAACT AAATACGATA TATTGGTTAC TG ATT AT AC C 
2051 CTAATTGCCA ATATTGATTC AATGTATAGA TCTTCCATAC CTACCAGTTC 

GATTAACGGT TATAACTAAG TTACATATCT AGAAGGTATG GATGGTCAAG 
2101 TGCGCCTGCA GCAATGCAAC AACGTTGCCC GGATCTGCGA TGATAAGCTG 

ACGCGGACGT CGTTACGTTG TTGCAACGGG CCTAGACGCT ACTATTCGAC 
2151 TCAAACATGA GAATTGGTCG ACTAGCTTGG CACGCCAGAA ATCCGCGCGG 

AGTTTGTACT CTTAACCAGC TGATCGAACC GTGCGGTCTT TAGGCGCGCC 

2201 TGGTTTTTGG GGGTCGGGGG TGTTTGGC AG CCACAGACGC CCGGTGTTCG 

ACCAAAAACC CCCAGCCCCC ACAAACCGTC GGTGTCTGCG GGCCACAAGC 

2251 TGTCGCGCCA GTACATGCGG TCCATGCCCA GGCCATCCAA AAACCATGGG 

ACAGCGCGGT CATGTACGCC AGGTACGGGT CCGGTAGGTT TTTGGT AC C C 

2301 TCTGTCTGCT CAGTCCAGTC GTGGACCAGA CCCCACGCAA CGCCCAAAAT 

AG AC AG AC G A GTCAGGTCAG CACCTGGTCT GGGGTGCGTT GCGGGTTTTA 

2351 AATAACCCCC ACGAACCATA AACCATTCCC CATGGGGGAC CCCGTCCCTA 

TTATTGGGGG TGC TTGGT AT TTGGTAAGGG GTACCCCCTG GGGCAGGGAT 

2401 ACCCACGGGG CCAGTGGCTA TGGCAGGGCC TGCCGCCCCG ACGTTGGCTG 

TGGGTGCCCC GGTC AC CGAT ACCGTCCCGG ACGGCGGGGC TGCAACCGAC 
2451 CGAGCCCTGG GCCTTCACCC GAACTTGGGG GGTGGGGTGG GGAAAAGGAA 

GCTCGGGACC C GG AAGTGGG CTTGAACCCC CCACCCCACC CCTTTTCCTT 
2501 GAAACGCGGG CGTATTGGCC CCAATGGGGT CTCGGTGGGG TATCGACAGA 

CTTTGCGCCC GCATAACCGG GGTTACCCCA GAGCCACCCC ATAGCTGTCT 
2551 GTGCCAGCCC TGGGACCGAA CCCCGCGTTT ATGAACAAAC GACCCAACAC 

CACGGTCGGG ACCCTGGCTT GGGGCGCAAA TACTTGTTTG CTGGGTTGTG 
2601 CCGTGCGTTT TATTCTGTCT TTTTATTGCC GTCATAGCGC GGGTTCCTTC 

GGCACGCAAA ATAAGACAGA AAAATAACGG CAGTATCGCG CCCAAGGAAG 

2 651 CGGTATTGTC TCCTTCCGTG TTTCAGTTAG CCTCCCCCAT CTCCCCTATT 

GCCATAACAG AGGAAGGCAC AAAGTCAATC GGAGGGGGTA GAGGGGATAA 

2701 CCTTTGCCCT CGGACGAGTG CTGGGGCGTC GGTTTCCACT ATCGGCGAGT 

GGAAACGGGA GCCTGCTCAC GACCCCGCAG CCAAAGGTGA TAGCCGCTCA 
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2751 ACTTCTACAC AGCCATCGGT CCAGACGGCC GCGCTTCTGC GGGCGATTTG 
TGAAGATGTG TCGGTAGCCA GGTCTGCCGG CGCGAAGACG CCCGCTAAAC 
2801 TGTACGCCCG ACAGTCCCGG CTCCGGATCG GACGATTGCG TCGCATCGAC 
ACATGCGGGC TGTCAGGGCC GAGGCCTAGC CTGCTAACGC AGCGTAGCTG 
2851 CCTGCGCCCA AGCTGCATCA TCGAAATTGC CGTCAACCAA GCTCTGATAG 
GGACGCGGGT TCGACGTAGT AGCTTTAACG GCAGTTGGTT CGAGACTATC 
2901 AGTTGGTCAA GACCAATGCG GAGCATATAC GCCCGGAGCC GCGGCGATCC 
TCAACCAGTT CTGGTTACGC CTCGTATATG CGGGCCTCGG CGCCGCTAGG 
2951 TGCAAGCTCC GGATGCCTCC GCTCGAAGTA GCGCGTCTGC TGCTCCATAC 
ACGTTC GAGG CCTACGGAGG CGAGCTTCAT CGCGCAGACG ACGAGGTATG 
3 001 AAGCCAACCA CGGCCTCCAG AAGAAGATGT TGGCGACCTC GTATTGGGAA 
TTCGGTTGGT GCCGGAGGTC TTCTTCTACA ACCGCTGGAG CATAACCCTT 
3051 TCCCCGAACA TCGCCTCGCT CCAGTCAATG ACCGCTGTTA TGCGGCCATT 
AGGGGCTTGT AGCGGAGCGA GGTCAGTTAC TGGCGACAAT ACGCCGGTAA 
3101 GTCCGTCAGG ACATTGTTGG AGCCGAAATC CGCGTGCACG AGGTGCCGGA 
CAGGCAGTCC TGTAACAACC TCGGCTTTAG GCGCACGTGC TCCACGGCCT 
3151 CTTCGGGGCA GTCCTCGGCC CAAAGCATCA GCTCATCGAG AGCCTGCGCG 
GAAGCCCCGT CAGGAGCCGG GTTTCGTAGT CGAGTAGCTC TCGGACGCGC 
3201 ACGGACGCAC TGACGGTGTC GTCCATCACA GTTTGCCAGT GATACACATG 
TGCCTGCGTG ACTGCCACAG CAGGTAGTGT CAAACGGTCA CTATGTGTAC 
3251 GGGATCAGCA ATCGCGCATA TGAAATCACG CCATGTAGTG TATTGACCGA 
CCCTAGTCGT TAGCGCGTAT ACTTTAGTGC GGTACATCAC ATAACTGGCT 
3301 TTCCTTGCGG TCCGAATGGG CCGAACCCGC TCGTCTGGCT AAGATCGGCC 
AAGGAACGCC AGGCTTACCC GGCTTGGGCG AGCAGACCGA TTCTAGCCGG 
3351 GCAGCGATCG CATC C ATGGC CTCCGCGACC GGCTGCAGAA CAGCGGGCAG 
CGTCGCTAGC GTAGGTACCG GAGGCGCTGG CCGACGTCTT GTCGCCCGTC 
3401 TTCGGTTTCA GGCAGGTCTT GCAACGTGAC ACCCTGTGCA CGGCGGGAGA 
AAGCCAAAGT CCGTCCAGAA C GTTGC AC TG TGGG AC AC GT GCCGCCCTCT 
3451 TGCAATAGGT CAGGCTCTCG CTGAATTCCC CAATGTCAAG CACTTCCGGA 
ACGTTATCCA GTCCGAGAGC GACTTAAGGG GTTACAGTTC GTGAAGGCCT 
3501 ATCGGGAGCG CGGCCGATGC AAAGTGCCGA TAAACATAAC GATCTTTGTA 
TAGCCCTCGC GCCGGCTACG TTTCACGGCT ATTTGTATTG CTAGAAACAT 
3551 GAAACCATCG GCGCAGCTAT TTACCCGCAG GACATATCCA CGCCCTCCTA 
CTTTGGTAGC CGCGTCGATA AATGGGCGTC CTGTATAGGT GCGGGAGGAT 
3601 CATCGAAGCT GAAAGCACGA GATTCTTCGC CCTCCGAGAG CTGCATCAGG 
GTAGCTTCGA CTTTCGTGCT CTAAGAAGCG GGAGGCTCTC GACGTAGTCC 
3 651 TCGGAGACGC TGTCGAACTT TTCGATCAGA AACTTCTCGA CAGACGTCGC 
AGCCTCTGCG ACAGCTTGAA AAGCTAGTCT TTGAAGAGCT GTCTGCAGCG 
3701 GGTGAGTTCA GGCTTTTTCA TATCTCATTG CCCGGGATCT GCGGCACGCT 
CCACTCAAGT CCGAAAAAGT ATAGAGTAAC GGGCCCTAGA CGCCGTGCGA 
3751 GTTGACGCTG TTAAGC GGGT CGCTGCAGGG TCGCTCGGTG TTCGAGGCCA 
CAACTGCGAC AATTCGCCCA GCGACGTCCC AGCGAGCCAC AAGCTCCGGT 
3801 CACGCGTCAC CTTAATATGC G AAGTGG AC C TGGGACCGCG CCGCCCCGAC 
GTGCGCAGTG G AATT AT AC G CTTCACCTGG ACCCTGGCGC GGCGGGGCTG 
3851 TGCATCTGCG TGTTCGAATT CGCCAATGAC AAGACGCTGG GCGGGGTTTG 
ACGTAGACGC ACAAGCTTAA G C GG TT AC TG TTCTGCGACC CGCCCCAAAC 
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3901 TGTCATCATA GAACTAAAGA CATGCAAATA TATTTCTTCC GGGGACACCG 
ACAGTAGTAT CTTGATTTCT GTACGTTTAT ATAAAGAAGG CCCCTGTGGC 
39 51 CCAGCAAACG CGAGCAACGG GCCACGGGGA TGAAGCAGGG CATGGCGGCC 
GGTCGTTTGC GCTCGTTGCC CGGTGCCCCT ACTTCGTCCC GTACCGCCGG 
4001 GACGCGCTGG GCTACGTCTT GCTGGCGTTC GCGACGCGAG GCTGGATGGC 
CTGCGCGACC CGATGCAGAA CGACCGCAAG CGCTGCGCTC CGACCTACCG 
4051 CTTCCCCATT ATGATTCTTC TCGCTTCCGG CGGCATCGGG ATGCCCGCGT 
GAAGGGGTAA TACTAAGAAG AGCGAAGGCC GCCGTAGCCC TACGGGCGCA 
4101 TGCAGGCCAT GCTGTCCAGG CAGGTAGATG ACGACCATCA GGGACAGCTT 
ACGTCCGGTA CGACAGGTCC GTCCATCTAC TGCTGGTAGT CCCTGTCGAA 
4151 CAAGGATCGC TCGCGGCTCT TACCAGCCTA ACTTCGATCA CTGGACCGCT 
GTTCCTAGCG AGCGCCGAGA ATGGTCGGAT TGAAGCTAGT GACCTGGCGA 
4201 GATCGTCACG GCGATTTATG CCGCCTCGGC GAGCACATGG AAC GGGTTGG 
CTAGCAGTGC CGCTAAATAC GGCGGAGCCG CTCGTGTACC TTGCCCAACC 
4251 CATGGATTGT AGGCGCCGCC CTATACCTTG TCTGCCTCCC CGCGTTGCGT 
GTACCTAACA TCCGCGGCGG GATATGGAAC AG AC GG AGGG GCGCAACGCA 
4301 CGCGGTGCAT GGAGCCGGGC CACCTCGACC TGAATGGAAG CCGGCGGCAC 
GCGCCACGTA CCTCGGCCCG GTGGAGCTGG ACTTACCTTC GGCCGCCGTG 
4351 CTCGCTAACG GATTCACCAC TCCAAGAATT GGAGCCAATC AATTCTTGCG 
GAGCGATTGC CTAAGTGGTG AGGTTCTTAA CCTCGGTTAG TTAAGAACGC 
4401 GAG AAC TGTG AATGCGCAAA CCAACCCTTG GC AG AAC AT A TCCATCGCGT 
CTCTTGACAC TTACGCGTTT GGTTGGGAAC CGTCTTGTAT AGGTAGCGCA 
4451 CCGCCATCTC CAGCAGCCGC ACGCGGCGCA GCAAAAGGCC AGG AAC C GT A 
GGC GGT AG AG GTCGTCGGCG TGCGCCGCGT CGTTTTCCGG TCCTTGGCAT 
4501 AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC CCCTGACGAG 
TTTTCCGGCG CAACGACCGC AAAAAGGTAT CCGAGGCGGG GGGACTGCTC 
4551 CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT 
GTAGTGTTTT TAGCTGCGAG TTCAGTCTCC ACCGCTTTGG GCTGTCCTGA 
4601 ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG 
TATTTCTATG GTCCGCAAAG GGGGACCTTC GAGGGAGCAC GC GAG AGG AC 
4651 TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 
AAGGCTGGGA CGGCGAATGG CCTATGGACA GGCGGAAAGA GGGAAGCCCT 
4701 AGCGTGGCGC TTTCTCATAG CTCACGCTGT AGGTATCTCA GTTCGGTGTA 
TCGCACCGCG AAAGAGTATC GAGTGCGACA TCCATAGAGT CAAGCCACAT 
4751 GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC GTTCAGCCCG 
CCAGCAAGCG AGGTTCGACC CGACACACGT GCTTGGGGGG CAAGTCGGGC 
4801 ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA CCCGGTAAGA 
TGGCGACGCG GAATAGGC C A TTGATAGCAG AACTCAGGTT GGGCCATTCT 
4851 CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 
GTGCTGAATA GCGGTGACCG TCGTCGGTGA CCATTGTCCT AATCGTCTCG 
4901 GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG 
C TC C AT AC AT CCGCCACGAT GTCTCAAGAA CTTCACCACC GGATTGATGC 
4951 GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT 
CGATGTGATC TTCCTGTCAT AAAC C AT AG A CGCGAGACGA CTTCGGTCAA 
5001 ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC 
TGGAAGCCTT TTTCTCAACC ATCGAGAACT AGGCCGTTTG TTTGGTGGCG 
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5051 TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA GCAGATTACG CGCAGAAAAA 
ACCATCGCCA CCAAAAAAAC AAACGTTCGT CGTCTAATGC GCGTCTTTTT 
5101 AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC TGACGCTCAG 
TTCCTAGAGT TCTTCTAGGA AACTAGAAAA GATGCCCCAG ACTGCGAGTC 
5151 TGG AAC G AAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TATCAAAAAG 
ACCTTGCTTT TGAGTGCAAT TCCCTAAAAC CAGTACTCTA ATAGTTTTTC 
5201 GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 
CTAGAAGTGG ATCTAGGAAA ATTTAATTTT TACTTCAAAA TTTAGTTAGA 
5251 AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT 
TTTCATATAT AC TC ATTTGA ACCAGACTGT CAATGGTTAC GAATTAGTCA 
5301 GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG 
CTCCGTGGAT AGAGTCGCTA GACAGATAAA GCAAGTAGGT ATCAACGGAC 
5351 ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCC 
TGAGGGGCAG CACATCTATT GATGCTATGC CCTCCCGAAT GGTAGAC CGG 
5401 CCAGTGCTGC AATGATACCG CGAGACCCAC GCTCACCGGC TC C AG ATTTA 
GGTCACGACG TTACTATGGC GCTCTGGGTG CGAGTGGCCG AGGTC TAAAT 
5451 TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC 
AGTCGTTATT TGGTCGGTCG GCCTTCCCGG CTCGCGTCTT CACCAGGACG 
5501 AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG 
TTGAAATAGG CGGAGGTAGG TCAGATAATT AACAACGGCC CTTCGATCTC 
5551 TAAGTAGTTC GC C AGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTGCA 
ATTCATCAAG CGGTCAATTA TCAAACGCGT TGCAACAACG GTAACGACGT 
5601 GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG 
CCGTAGCACC ACAGTGCGAG CAGCAAACCA TACCGAAGTA AGTCGAGGCC 
5651 TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG 
AAGGGTTGCT AGTTCCGCTC AATGTACTAG GGGGTACAAC ACGTTTTTTC 
5701 CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA 
GCCAATCGAG GAAGCCAGGA GGCTAGCAAC AGTCTTCATT CAACCGGCGT 
5751 GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC TTACTGTCAT 
CACAATAGTG AGTACCAATA CCGTCGTGAC GTATTAAGAG AATGACAGTA 
5801 GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT 
CGGTAGGCAT TCTACGAAAA G AC AC TG AC C ACTCATGAGT TGGTTCAGTA 
5851 TCTGAGAATA GTGTATGCGG C G AC C G AGTT GCTCTTGCCC GGCGTCAACA 
AGACTCTTAT CACATACGCC GCTGGCTCAA CGAGAACGGG CCGCAGTTGT 
5901 CGGGATAATA CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG 
GCCCTATTAT GGCGCGGTGT ATCGTCTTGA AATTTTCACG AGTAGT AAC C 
5951 AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT 
TTTTGCAAGA AGCCCCGCTT TTGAGAGTTC CTAGAATGGC GACAACTCTA 
6001 CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT 
GGTCAAGCTA CATTGGGTGA GCACGTGGGT TGACTAGAAG TCGTAGAAAA 
6051 ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC 
TGAAAGTGGT CGCAAAGACC CACTCGTTTT TGTCCTTCCG TTTTACGGCG 
6101 AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC 
TTTTTTCCCT TATTCCCGCT GTGCCTTTAC AACTTATGAG TATGAGAAGG 
6151 TTTTTC AATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA 
AAAAAGTTAT AATAACTTCG TAAATAGTCC CAATAACAGA GTACTCGCCT 
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6201 TACATATTTG AATGTATTTA 

ATGTATAAAC TTACATAAAT 
6251 ATTTCCCCGA AAAGTGCCAC 

TAAAGGGGCT TTTCACGGTG 
6301 CATTAACCTA TAAAAATAGG 

GTAATTGGAT ATTTTTATCC 
6351 TTCTCATGTT TGACAGCTTA 

AAGAGTACAA ACTGTCGAAT 
6401 CCCAGCTTTT CTTCCGTTGC 

GGGTCGAAAA GAAGGCAACG 
6451 AGAGGAAGAG GAGGGGTCCC 

TCTCCTTCTC CTCCCCAGGG 
6501 AGGGGGACGA GGAATTTGAG 

TCCCCCTGCT CCTTAAACTC 
6551 GAGGATGTTC AGCGGGAGTT 

CTCCTACAAG TCGCCCTCAA 
6601 CGAGGATGGT TCGGAGGATG 

GCTCCTACCA AGCCTCCTAC 
6651 GCGACCATGA AGGGGATGAG 

CGCTGGTACT TCCCCTACTC 
6701 CTGCACTCCC TGTATTCACT 

GACGTGAGGG ACATAAGTGA 
6751 CTCTTTTAGT GTGAATCATG 

G AG AAAAT C A CACTTAGTAC 

6801 AATGGCCTAG GAGAGAAGGG 

TTACCGGATC CTCTCTTCCC 
6851 CAGTGGACCT CAAAGAAGAG 

GTCACCTGGA GTTTCTTCTC 
6901 GAGGACGAGG ACGAGGAGGC 

CTCCTGCTCC TGCTCCTCCG 
6951 TCAGGGCCAA GACATAGAGA 

AGTCCCGGTT CTGTATCTCT 
7 001 TTGCATTGGC TGCAAAGGGA 

AACGTAACCG ACGTTTCCCT 
7051 CGGGAGGGGC AGGAGCAGGA 

GCCCTCCCCG TCCTCGTCCT 
7101 GGAGGAGGGG CAGGAGGGGC 

CCTCCTCCCC GTCCTCCCCG 
7151 AGGGGCAGGA GCAGGAGGAG 

TCCCCGTCCT CGTCCTCCTC 
7 201 GAGGAGGGGC AGGAGCAGGA 

CTCCTCCCCG TCCTCGTCCT 
7251 GGGGCAGGAG GGGCAGGAGG 

CCCCGTCCTC CCCGTCCTCC 
7 3 01 AGGAGGGGCA GGAGGGGCAG 

TCCTCCCCGT CCTCCCCGTC 



GAAAAATAAA CAAATAGGGG TTCCGCGCAC 
CTTTTTATTT GTTTATCCCC AAGGCGCGTG 
CTGACGTCTA AGAAACCATT ATTATCATGA 
GACTGCAGAT TCTTTGGTAA TAATAGTACT 
CGTATCACGA GGCCCTTTCG TCTTCAAGAA 
GCATAGTGCT CCGGGAAAGC AGAAGTTCTT 
TCATCGATAA GCTGATCCTC ACAGGCCGCA 
AGTAGCTATT CGACTAGGAG TGTCCGGCGT 
CCCAGTAGCA TCTCTGTCTG GTGACCTTGA 
GGGTCATCGT AG AG AC AG AC CACTGGAACT 
GAGAATCCCC ATCCCTACCG TCCAGCAAAA 
CTCTTAGGGG TAGGGATGGC AGGTCGTTTT 
GCCTGGCTTG AGGCTCAGGA CGCAAATCTT 
CGGACCGAAC TCCGAGTCCT GCGTTTAGAA 
TTCCGGGCTG CGAGTAATTG GTGATGAGGA 
AAGGCCCGAC GCTCATTAAC CACTACTCCT 
GGGAATTTTC AGACCTGGAT CTGTCTGACA 
CCCTTAAAAG TCTGGAC CTA G AC AG AC TGT 
GGTGGGGGGG CTGTTGGAGG GGGCAGGAGT 
CCACCCCCCC GACAACCTCC CCCGTCCTCA 
GAGCGTCGTC TAATAAAGAT GTCTATTGAT 
CTCGCAGCAG ATTATTTCTA CAGATAACTA 
TCTGACGAGG GGCCAGGTAC AGGACCTGGA 
AGACTGCTCC CCGGTCCATG TCCTGGACCT 
AGACACATCT GGACCAGAAG GCTCCGGCGG 
TCTGTGTAGA CCTGGTCTTC CGAGGCCGCC 
GGGGTGATAA CCATGGACGA GGACGGGGAA 
CCCCACTATT GGTACCTGCT CCTGCCCCTT 
GGAAGACCAG GAGCCCCGGG CGGCTCAGGA 
CCTTCTGGTC CTCGGGGCCC GCCGAGTCCT 
TGGTGTCCGG AGACCCCAAA AACGTCCAAG 
ACCACAGGCC TCTGGGGTTT TTGCAGGTTC 
CCCACGGTGG AACAGGAGCA GGAGCAGGAG 
GGGTGCCACC TTGTCCTCGT CCTCGTCCTC 
GGGGCAGGAG CAGGAGGAGG GGCAGGAGCA 
CCCCGTCCTC GTCCTCCTCC CCGTCCTCGT 
AGGAGGGGCA GGAGGGGCAG GAGCAGGAGG 
TCCTCCCCGT CCTCCCCGTC CTCGTCCTCC 
GGGCAGGAGG GGCAGGAGGG GCAGGAGCAG 
CCCGTCCTCC CCGTCCTCCC CGTCCTCGTC 
GGAGGGGCAG GAGGGGCAGG AGCAGGAGGA 
CCTCCCCGTC CTCCCCGTCC TCGTCCTCCT 
GGCAGGAGCA GGAGGAGGGG CAGGAGCAGG 
CCGTCCTCGT CCTCCTCCCC GTCCTCGTCC 
GAGCAGGAGG AGGGGCAGGA GGGGCAGGAG 
CTCGTCCTCC TCCCCGTCCT CCCCGTCCTC 
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7351 GGGCAGGAGC AGGAGGAGGG GCAGGAGCAG GAGGGGCAGG AGGGGCAGGA 

CCCGTCCTCG TCCTCCTCCC CGTCCTCGTC CTCCCCGTCC TCCCCGTCCT 
7401 GGGGCAGGAG CAGGAGGGGC AGGAGCAGGA GGAGGGGCAG GAGGGGCAGG 

CCCCGTCCTC GTCCTCCCCG TCCTCGTCCT CCTCCCCGTC CTCCCCGTCC 
7451 AGGGGCAGGA GCAGGAGGGG CAGGAGCAGG AGGGGCAGGA GCAGGAGGGG 

TCCCCGTCCT CGTCCTCCCC GTCCTCGTCC TCCCCGTCCT CGTCCTCCCC 
7501 CAGGAGCAGG AGGGGCAGGA GGGGCAGGAG CAGGAGGGGC AGGAGGGGCA 

GTCCTCGTCC TCCCCGTCCT CCCCGTCCTC GTCCTCCCCG TCCTCCCCGT 
7551 GGAGCAGGAG GGGCAGGAGG GGCAGGAGCA GGAGGAGGGG CAGGAGGGGC 

CCTCGTCCTC CCCGTCCTCC CCGTCCTCGT CCTCCTCCCC GTCCTCCCCG 
7601 AGGAGCAGGA GGAGGGGCAG GAGGGGCAGG AGCAGGAGGG GCAGGAGGGG 

TCCTCGTCCT CCTCCCCGTC CTCCCCGTCC TCGTCCTCCC CGTCCTCCCC 
7 651 CAGGAGCAGG AGGGGCAGGA GGGGCAGGAG CAGGAGGGGC AGGAGGGGCA 

GTCCTCGTCC TCCCCGTCCT CCCCGTCCTC GTCCTCCCCG TCCTCCCCGT 
7701 GGAGCAGGAG GAGGGGCAGG AGCAGGAGGG GCAGGAGCAG GAGGTGGAGG 

CCTCGTCCTC CTCCCCGTCC TCGTCCTCCC CGTCCTCGTC CTCCACCTCC 
7751 CCGGGGTCGA GGAGGCAGTG GAGGCCGGGG TCGAGGAGGT AGTGGAGGCC 

GGCCCCAGCT CCTCCGTCAC CTCCGGCCCC AGCTCCTCCA TCACCTCCGG 
7801 GGGGTCGAGG AGGTAGTGGA GGCCGCCGGG GTAGAGGACG TGAAAGAGCC 

CCCCAGCTCC TCCATCACCT CCGGCGGCCC CATCTCCTGC ACTTTCTCGG 
7851 AGGGGGGGAA GTCGTGAAAG AGCCAGGGGG AGAGGTCGTG GACGTGGAGA 

TCCCCCCCTT CAGCACTTTC TCGGTCCCCC TCTCCAGCAC CTGCACCTCT 
7901 AAAGAGGCCC AGGAGTCCCA GTAGTCAGTC ATCATCATCC GGGTCTCCAC 

TTTCTCCGGG TCCTCAGGGT CATCAGTCAG TAGTAGTAGG CCCAGAGGTG 

7 951 CGCGCAGGCC CCCTCCAGGT AGAAGGCCAT TTTTCCACCC TGTAGGGGAA 

GCGCGTCCGG GGGAGGTCCA TCTTCCGGTA AAAAGGTGGG ACATCCCCTT 
8001 GCCGATTATT TTGAATACCA CCAAGAAGGT GGCCCAGATG GTGAGCCTGA 
CGGCTAATAA AACTTATGGT GGTTCTTCCA CCGGGTCTAC CACTCGGACT 

8 051 CGTGCCCCCG GGAGCGATAG AGCAGGGCCC CGCAGATGAC CCAGGAGAAG 

GCACGGGGGC CCTCGCTATC TCGTCCCGGG GCGTCTACTG GGTCCTCTTC 
8101 GCCCAAGCAC TGGACCCCGG GGTCAGGGTG ATGGAGGCAG GCGCAAAAAA 

CGGGTTCGTG ACCTGGGGCC CCAGTCCCAC TACCTCCGTC CGCGTTTTTT 
8151 GGAGGGTGGT TTGGAAAGCA TCGTGGTCAA GGAGGTTCCA ACCCGAAATT 

CCTCCCACCA AACCTTTCGT AGCACCAGTT CCTCCAAGGT TGGGCTTTAA 
8201 TGAGAACATT GCAGAAGGTT TAAGAGCTCT CCTGGCTAGG AGTCACGTAG 

ACTCTTGTAA CGTCTTCCAA ATTCTCGAGA GGACCGATCC TCAGTGCATC 
8251 AAAGGACTAC CGACGAAGGA ACTTGGGTCG CCGGTGTGTT CGTATATGGA 

TTTCCTGATG GCTGCTTCCT TGAACCCAGC GGCCACACAA GCATATACCT 
8301 GGTAGTAAGA CCTCCCTTTA CAACCTAAGG CGAGGAACTG CCCTTGCTAT 

CCATCATTCT GGAGGGAAAT GTTGG ATTC C GCTCCTTGAC GGGAACGATA 
8351 TCCACAATGT CGTCTTACAC CATTGAGTCG TCTCCCCTTT GGAATGGCCC 

AGGTGTTACA GCAGAATGTG GTAACTCAGC AGAGGGGAAA CCTTACCGGG 
8401 CTGGACCCGG CCCACAACCT GGCCCGCTAA GGGAGTCCAT TGTCTGTTAT 

GACCTGGGCC GGGTGTTGGA CCGGGCGATT CCCTCAGGTA ACAGACAATA 
8451 TTCATGGTCT TTTTACAAAC TC AT AT ATTT GCTGAGGTTT TGAAGGATGC 

AAGTACCAGA AAAATGTTTG AGTATATAAA CGACTCCAAA ACTTCCTACG 
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8501 GATTAAGGAC CTTGTTATGA CAAAGCCCGC TCCTACCTGC AATATCAGGG 

CTAATTCCTG GAACAATACT GTTTCGGGCG AGGATGGACG TTATAGTCCC 
8551 TGACTGTGTG CAGCTTTGAC GATGGAGTAG ATTTGCCTCC CTGGTTTCCA 

ACTGACACAC GTCGAAACTG CTACCTCATC TAAACGGAGG GACCAAAGGT 
8601 CCTATGGTGG AAGGGGCTGC CGCGGAGGGT GATGACGGAG ATGACGGAGA 

GGATACCACC TTCCCCGACG GCGCCTCCCA CTACTGCCTC TACTGCCTCT 
8651 TGAAGGAGGT GATGGAGATG AGGGTGAGGA AGGGCAGGAG TGATGTAACT 

ACTTCCTCCA CTACCTCTAC TCCCACTCCT TCCCGTCCTC ACTACATTGA 
8701 TGTTAGGAGA CGCCCTCAAT CGTATTAAAA GCCGTGTATT CCCCCGCACT 

ACAATCCTCT GCGGGAGTTA GCATAATTTT CGGCACATAA GGGGGCGTGA 
8751 AAAGAATAAA TCCCCAGTAG ACATCATGCG TGCTGTTGGT GTATTTCTGG 

TTTCTTATTT AGGGGTCATC TGTAGTACGC ACGACAACCA CATAAAGACC 
8801 CCATCTGTCT TGTCACCATT TTCGTCCTCC CAACATGGGG CAATTGGGCA 

GGTAGACAGA ACAGTGGTAA AAGCAGGAGG GTTGTACCCC GTTAACCCGT 
8851 TACCCATGTT GTCACGTCAC TCAGCTCCGC GCTCAACACC TTCTCGCGTT 

ATGGGTACAA CAGTGCAGTG AGTCGAGGCG CGAGTTGTGG AAGAGCGCAA 
8901 GGAAAACATT AGCGACATTT AC C TGGTG AG CAATCAGACA TGCGACGGCT 

CCTTTTGTAA TCGCTGTAAA TGGACCACTC GTTAGTCTGT ACGCTGCCGA 
8951 TTAGCCTGGC CTCCTTAAAT TCACCTAAGA ATGGGAGCAA CCAGCATGCA 

AATCGGACCG GAGGAATTTA AGTGGATTCT TACCCTCGTT GGTCGTACGT 
9001 GGAAAAGGAC AAGCAGCGAA AATTCACGCC CCCTTGGGAG GTGGCGGCAT 

CCTTTTCCTG TTCGTCGCTT TTAAGTGCGG GGGAACCCTC CACCGCCGTA 
9051 ATGCAAAGGA TAGCACTCCC ACTCTACTAC TGGGTATCAT ATGCTGACTG 

TACGTTTCCT ATCGTGAGGG TGAGATGATG ACCCATAGTA T AC G AC TG AC 
9101 TATATGCATG AGGATAGCAT ATGCTACCCG GATACAGATT AGGATAGCAT 

ATATACGTAC TCCTATCGTA TACGATGGGC CTATGTCTAA TCCTATCGTA 
9151 ATACTACCCA GATATAGATT AGGATAGCAT ATGCTACCCA GATATAGATT 

TATGATGGGT CTATATCTAA TCCTATCGTA TACGATGGGT CTATATCTAA 
9201 AGGATAGCCT ATGCTACCCA GATATAAATT AGGATAGCAT ATACTACCCA 

TCCTATCGGA TACGATGGGT C T AT ATTTAA TCCTATCGTA TATGATGGGT 
9251 GATATAGATT AGGATAGCAT ATGCTACCCA GATATAGATT AGGATAGCCT 

CTATATCTAA TCCTATCGTA TACGATGGGT CTATATCTAA TCCTATCGGA 
9301 ATGCTACCCA GATATAGATT AGGATAGCAT ATGCTACCCA GATATAGATT 

TACGATGGGT CTATATCTAA TCCTATCGTA TACGATGGGT CTATATCTAA 
9351 AGGATAGCAT ATGCTATCCA GATATTTGGG TAGTATATGC TACCCAGATA 

TCCTATCGTA T AC G AT AGGT CTATAAACCC ATCATATACG ATGGGTCTAT 
9401 TAAATTAGGA TAGCATATAC TACCCTAATC TCTATTAGGA TAGCATATGC 

ATTTAATCCT ATCGTATATG ATGGGATTAG AGATAATCCT ATCGTATACG 
9451 TACCCGGATA CAGATTAGGA TAGCATATAC TACCCAGATA TAGATTAGGA 

ATGGGCCTAT GTCTAATCCT ATCGTATATG ATGGGTCTAT ATCTAATCCT 
9501 TAGCATATGC TACCCAGATA TAGATTAGGA TAGCCTATGC TACCCAGATA 

ATCGTATACG ATGGGTCTAT ATCTAATCCT ATCGGATACG ATGGGTCTAT 
9551 TAAATTAGGA TAGCATATAC TACCCAGATA TAGATTAGGA TAGCATATGC 

ATTTAATCCT ATCGTATATG ATGGGTCTAT ATCTAATCCT ATCGTATACG 
9 601 TACCCAGATA TAGATTAGGA TAGCCTATGC TACCCAGATA TAGATTAGGA 

ATGGGTCTAT ATCTAATCCT ATCGGATACG ATGGGTCTAT ATCTAATCCT 
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9 651 TAGCATATGC TATCCAGATA 

ATCGTATACG ATAGGTCTAT 
9701 TTAGCCCACC GTGCTCTCAG 

AATCGGGTGG CACGAGAGTC 
9751 TGTGCTTGGC GCTCAGGCGC 

ACACGAACCG CGAGTCCGCG 
9801 GCAATCGCGC CCCTATCTTG 

CGTTAGCGCG GGGATAGAAC 
9851 CGGGGTGC C A TTAGTGGTTT 

GCCCCACGGT AATCACCAAA 
9901 GCGGGGTTAC AATCAGCCAA 

CGCCCCAATG TTAGTCGGTT 
9951 CCGCAGGGCG GCGTGTGGGG 

GGCGTCCCGC CGCACACCCC 
10001 AAAAAAAGAG TGGCCACTTG 

TTTTTTTCTC ACCGGTGAAC 
10051 CCCCGTTTAA TTTTCGGGGG 

GGGGCAAATT AAAAGCCCCC 
10101 GTCGGCGTCC ACTCTCTTTC 

CAGCCGCAGG TGAGAGAAAG 
10151 GTTCACCTGT CTTGGTCCCT 

CAAGTGGACA GAACCAGGGA 
10201 CATATTGCAC TAGGATTATG 

GTATAACGTG ATCCTAATAC 
10251 TGGACATCCA GTCTTTACGG 

ACCTGTAGGT CAGAAATGCC 
10301 AAAGATATTC AGAATGTTTC 

TTTCTATAAG TCTTACAAAG 
10351 GTTTGTGAGG GTTATATTGG 

CAAACACTCC CAATATAACC 
10401 CCGTCCAAAT TTTATTCTGG 

GGCAGGTTTA AAATAAGACC 
10451 CCTCACATAC ACCTTACTGT 

GGAGTGTATG TGGAATGACA 
10501 ACGAAGGAGA ATGAAGAAGC 

TGCTTCCTCT TACTTCTTCG 
10551 CTCCTTGATC TTCAGCCACT 

GAGGAACTAG AAGTC GGTG A 
10601 CGTGGAATCC TGACCCCATG 

GC AC CTTAGG ACTGGGGTAC 
10651 GAGATATCGC TGTTCCTTAG 

CTCTATAGCG ACAAGGAATC 
10701 TATGCTTCCC GTTGGGTAAC 

ATACGAAGGG CAACCCATTG 
10751 AG TAT AT ACT ACTACCCGGG 

TCATATATGA TGATGGGCCC 



TTTGGGTAGT ATATGCTACC CATGGCAACA 
AAACCCATCA TAT AC G ATGG GTACCGTTGT 
CGACCTCGTG AATATGAGGA CCAACAACCC 
GCTGGAGCAC TTATACTCCT GGTTGTTGGG 
AAGTGTGTGT AATTTGTCCT CCAGATCGCA 
TTCACACACA TTAAACAGGA GGTCTAGCGT 
GCCCGCCCAC CTACTTATGC AGGTATTCCC 
CGGGCGGGTG GATGAATACG TCCATAAGGG 
TGTGGGCAAG TGGTTTGACC GCAGTGGTTA 
ACACCCGTTC ACCAAACTGG CGTCACCAAT 
GTTATTACAC CCTTATTTTA CAGTCCAAAA 
CAATAATGTG GGAATAAAAT GTCAGGTTTT 
GCTGACGCGT GCCCCCACTC CACAATTTCA 
CGACTGCGCA CGGGGGTGAG GTGTTAAAGT 
TCTTTGTTTA TGGGCCCCAT TGGCGTGGAG 
AGAAACAAAT ACCCGGGGTA ACCGCACCTC 
TGTTAGAGAC AACCAGTGGA GTCCGCTGCT 
ACAATCTCTG TTGGTC AC C T CAGGCGACGA 
CCCTTGTTAC AAATAGAGTG TAACAACATG 
GGGAACAATG TTTATCTCAC ATTGTTGTAC 
GCCTGGGACA CATCTTAATA ACCCCAGTAT 
CGGACCCTGT GTAGAATTAT TGGGGTCATA 
TGTTGCCCAT AGCCATAAAT TC GTGTG AG A 
ACAACGGGTA TCGGTATTTA AGCACACTCT 
CTTGTCCCCA CCCCATGGAT TTCTATTGTT 
GAACAGGGGT GGGGT AC C T A AAGATAACAA 
ATTCCTACAC TAGTATTTAT TGCCCAAGGG 
TAAGGATGTG ATCATAAATA ACGGGTTCCC 
TGTCATAGCA CAATGCCACC ACTGAACCCC 
ACAGTATCGT GTT AC GGTGG TGACTTGGGG 
GGGCGTCACC TGAAACCTTG TTTTCGAGCA 
CCCGCAGTGG ACTTTGGAAC AAAAGCTCGT 
TCACAACTCA GCAGTTATTC TATTAGCTAA 
AGTGTTGAGT CGTCAATAAG ATAATCGATT 
AGGCGAAGAT TCAGGAGAGT TCACTGCCCG 
TCCGCTTCTA AGTCCTCTCA AGTGACGGGC 
GCCCTTGTGA CTAAAATGGT TCACTACCCT 
CGGGAACACT GATTTTACCA AGTGATGGGA 
TAAATAAAAC CGTGACAGCT CATGGGGTGG 
ATTTATTTTG GCACTGTCGA GTACCCCACC 
GACCCTTTTA CTAACCCTAA TTCGATAGCA 
CTGGGAAAAT GATTGGGATT AAGCTATCGT 
ATATGCTATT GAATTAGGGT TAGTCTGGAT 
TAT AC G AT AA CTTAATCCCA ATCAGACCTA 
AAGCATATGC TACCCGTTTA GGGTTAACAA 
TTCGTATACG ATGGGCAAAT CCCAATTGTT 
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10801 GGGGGCCTTA TAAACACTAT 

CCCCCGGAAT ATTTGTGATA 
10851 GGTAGCTACA CAGGCCCCTC 

CCATCGATGT GTCCGGGGAG 
10901 TTCCTGGGCC CCTGGGAGGT 

AAGGACCCGG GGACCCTCCA 
10951 CAGCCAAGAG TTACACATAA 

GTCGGTTCTC AATGTGTATT 
11001 CAAAGTCTGC TCCAGGATGA 

GTTTC AG AC G AGGTCCTACT 
11051 CCATTTATAA GGATGTCAAC 

GGTAAATATT CCTACAGTTG 
11101 CCCCCCCGTG TCACATGTGG 

GGGGGGGCAC AGTGT AC AC C 
11151 CAACTGAAGG GATTACATGC 

GTTGACTTCC CTAATGTACG 
11201 GTAGTCAGGT TTAGTTCGTC 

CATCAGTCCA AATCAAGCAG 
11251 TGGCTGCGCC CCGACACCCG 

ACCGACGCGG GGCTGTGGGC 
11301 TGTCTGCTCC CGGCATCCGC 

ACAGACGAGG GCCGTAGGCG 
11351 CTGCATGTGT CAGAGGTTTT 

GACGTACACA GTCTCCAAAA 
11401 CGGATCATAA TCAGCCATAC 

GCCTAGTATT AGTCGGTATG 
11451 AAACCTCCCC ACCTCCCCCT 

TTTGGAGGGG TGGAGGGGGA 
11501 TGTTGTTAAC TTGTTTATTG 

AC AAC AATTG AACAAATAAC 
11551 GCATCACAAA TTTCACAAAT 

CGTAGTGTTT AAAGTGTTTA 
11601 GGTTTGTCCA AACTCATCAA 

CCAAACAGGT TTGAGTAGTT 
11651 AGGCGGGGAG GCGGCCCAAA 

TCCGCCCCTC CGCCGGGTTT 
117 01 GAAGACGCGG AAG AGGC CGC 

CTTCTGCGCC TTCTCCGGCG 
11751 TCCGCTGGAT TGAGGGCCGA 

AGGCGACCTA ACTCCCGGCT 
11801 AATCCAGGTG GCAACACAGG 

TTAGGTC C AC CGTTGTGTCC 
11851 CCGACAACAC C AC GG AATTG 

GGCTGTTGTG GTGCCTTAAC 
11901 CAGCGGGCAA GGCAGGCGGC 

GTCGCCCGTT CCGTCCGCCG 



TGCT AATGC C CTCTTGAGGG TCCGCTTATC 
AC G ATT AC GG GAG AAC TC C C AGGCGAATAG 
TGATTGACGT TGGTGTAGCC TCCCGTAGTC 
ACTAACTGCA ACCACATCGG AGGGCATCAG 
ACATGTCCCC CAGCATTGGT GTAAGAGCTT 
TGTACAGGGG GTCGTAACCA CATTCTCGAA 
AGGCAATGTT GTGTTGCAGT CCACAGACTG 
TCCGTTACAA CACAACGTCA GGTGTCTGAC 
AAGCCACTCA GTGTTGGCAA ATGTGCACAT 
TTCGGTGAGT CACAACCGTT TACACGTGTA 
TACAGTCAGA GAACCCCTTT GTGTTTGGTC 
ATGTCAGTCT CTTGGGGAAA CACAAACCAG 
AACAGGGCCC AGTTGG C AAG TTGT AC C AAC 
TTGTCCCGGG TCAACCGTTC AACATGGTTG 
ACTGCCCCGC GAAGAAGGGG CAGAGATGCC 
TGACGGGGCG CTTCTTCCCC GTCTCTACGG 
CGGCGGCGGG GCTCTAGAGT CGACCGGTCA 
GCCGCCGCCC CGAGATCTCA GCTGGCCAGT 
CCAACACCCG CTGACGCGCC CTGACGGGCT 
GGTTGTGGGC GACTGCGCGG GACTGCCCGA 
TTACAGACAA GCTGTG AC CG TCTCCGGGAG 
AATGTC TGTT CGACACTGGC AGAGGCCCTC 
CACCGTCATC ACCGAAACGC GCGAGGCAGC 
^GTGGCAGTAG TGGCTTTGCG CGCTCCGTCG 
CACATTTGTA GAGGTTTTAC TTGCTTTAAA 
GTGTAAACAT CTCCAAAATG AAC G AAATTT 
GAACCTGAAA CATAAAATGA ATGCAATTGT 
CTTGGACTTT GTATTTTACT TACGTTAACA 
CAGCTTATAA TGGTTACAAA TAAAGCAATA 
GTCGAATATT AC C AATGTTT ATTTCGTTAT 
AAAGCATTTT TTTCACTGCA TTCTAGTTGT 
TTTCGTAAAA AAAGTG AC GT AAGATCAACA 
TGTATCTTAT CATGTCTGGA TCCCACGTGC 
AC AT AG AAT A GTACAGACCT AGGGTGCACG 
GGGAGATCCG ACTCGTCTGA GGGCGAAGGC 
CCCTCTAGGC TGAGCAGACT CCCGCTTCCG 
AG AGC CGGC A GCAGGCCGCG GGAAGGAAGG 
TCTCGGCCGT CGTCCGGCGC CCTTCCTTCC 
AGGGACGTAG CAGAAGGACG TCCCGCGCAG 
TCCCTGCATC GTCTTCCTGC AGGGCGCGTC 
CGAGCAGCCA AGGAAAGGAC GATGATTTCC 
GCTCGTCGGT TCCTTTCCTG CTACTAAAGG 
TCAGTGCCCA AC AGC CG AGC CCCTGTCCAG 
AGTCACGGGT TGTCGGCTCG GGGACAGGTC 
GATGAGTTCC GCCGTGGCAA TAGGGAGGGG 
CTACTCAAGG CGGCACCGTT ATCCCTCCCC 
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11951 GAAAGCGAAA GTCCCGGAAA GGAGCTGACA GGTGGTGGCA ATGCCCCAAC 

CTTTCGCTTT CAGGGCCTTT CCTCGACTGT CCACCACCGT TACGGGGTTG 

12001 CAGTGGGGGT TGCGTCAGCA AACACAGTGC ACACCACGCC ACGTTGCCTG 

GTCACCCCCA ACGCAGTCGT TTGTGTCACG TGTGGTGCGG TGCAACGGAC 

12 051 ACAACGGGCC ACAACTCCTC ATAAAGAGAC AGCAACCAGG ATTTATACAA 

TGTTGCCCGG TGTTGAGGAG TATTTCTCTG TCGTTGGTCC TAAATATGTT 

12101 GGAGGAGAAA ATGAAAGCCA TACGGGAAGC AATAGCATGA TACAAAGGCA 

CCTCCTCTTT TACTTTCGGT ATGCCCTTCG TTATCGTACT ATGTTTCCGT 

12151 TTAAAGCAGC GTATCCACAT AGCGTAAAAG GAGCAACATA GTTAAGAATA 

AATTTCGTCG CATAGGTGTA TCGCATTTTC CTCGTTGTAT CAATTCTTAT 

12201 CCAGTCAATC TTTCACAAAT TTTGTAATCC AGAGGTTGAT TC 

GGTCAGTTAG AAAGTGTTTA AAACATTAGG TCTCCAACTA AG 



12. 
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1 TCGAGCGGCC GCTTTAAACT CAATGGTGAT GGTGATGATG ACCGGTACGC 
AGCTCGCCGG CGAAATTTGA G TTACCACTA CCACTACTAC TGGC C ATGC G 
" 3 HHHHHHGTRT 
51 GTAGAATCGA GACCGAGGAG AGGGTTAGGG ATAGGCTTAC C G AATTC GAG 
CATCTTAGCT CTGGCTCCTC TCCCAATCCC TATCCGAATG GCTTAAGCTC 
-3 ■ T S D L GL L PNP I PKG FEL 
101 CTGGGTCACC ATGCCGCTGG CTTCGGAGTG AGTCTTTATG GCCTGGAAGA 
GACCCAGTGG TACGGCGACC GAAGCCTCAC TCAGAAATAC C GG ACCT TC T 
"3 QTVMGSAESHTKIAQFI- 
151 TCCACTCCCG GAAGTCACTG ACTTTGGTGT AGACGCCTGG CTTCTGGGCC 
AGGTGAGGGC C TTC AGTGAC TGAAACCACA TCTGC GG AC C GAAGACCCGG 
-3 • I W E R FDS V KT Y VG P KQAL 
2 01 AGGGCACAGC CAGTGCCCCA ACTCACAATG CCACACAGCC GCCAACGTGG 
TCCCGTGTCG GTCACGGGGT TGAGTGTTAC GGTGTGTCGG CGGTTGCACC 
-3 -LA C G TGW S V I G CL R WRP 

2 51 CGTCCGAGAG ATGCTGTCCT CACACACAAA GGGACCACCG CTGTCGCCCT 

pCAGGCTCTC TACGACAGGA GTGTGTGTTT C C CTGGTGGC GACAGCGGGA 
" 3 TRS I SD ECV F PGG S D G Q • 

3 01 GGCAGGCATC AATGCCACCC TCGGGGTAGC CAGCACAGAA CATCTTGGGC 

CCGTCCGTAG TTACGGTGGG AGCCCCATCG GTCGTGTCTT G T AG AAC C C G 
-3 • Q C A D !GG E PY G AC F MKPK 
351 TTGATCTGGT TTC CAT AG AA ATCAGCGCCA TTGCAGACAT CATTGCTGAT 
AAC TAG AC C A AAGGTATCTT TAGTCGCGGT AACGTCTGTA GT AAC G AC T A 
-3-KIQ N GY F DAG N CV D NS I 
401 T ATGGGG AC T CGAGCCTCCT GGAGTACCCC GGCCTGTTGG CCATAGTACT 
AT AC C C C TG A GCTCGGAGGA CCTCATGGGG CCGGACAACC GGTATCATGA 
-3 1PVRAEQLVGAQQGYYQ- 
451 GCGTGTTGCC CCAGCCCGTC ACGGTACAGA TCTTGCCATC CACCAGGGCC 
CGCACAACGG pGTCGGGCAG TGCCATGTCT AGAACGGTAG GTGGTCC C GG 
-3 * Q T N GWGT V TC I KG D VLAQ 
501 TGGC CGGC AG CTGGGAGGCA CACAGGCTGG ATGTATTCTG TGAGGGGCAG 
ACCGGCCGTC GACCCTCCGT GTGTCCGACC T AC AT AAGAC A : CTCCCCGTC 
-3 • Q G A A PL C VPQ I YET L P L 
551 GGG AC TGG AG AGGTGG AC C A GGGCAATATC GTTGCTGTTC TCCTCGCTGT 
CCCTGACCTC TCCACCTGGT CCCGTTATAG CAACGACAAG AGG AGCG AC A 
-3 PSSLHVLAIDNSNEESN- 
601 TGGGGTCCCG AAAGGGAAGA TAGCCCCCGT GGT AG AC C AC AGCCTGCACC 
ACCCCAGGGC TTTCCCTTCT ATCGGGGGCA CCATCTGGTG TCGGACGTGG 
-3 NPD R FPL Y GG H YV V AQVG 
651 CCCAGCTGCA GACCGTGGGG AGAGGCCTGG GCCACGGCAC CGGCAAACAC 
GGGTCGACGT CTGGCACCCC TCTCCGGACC CGGTGCCGTG GCCGTTT GTG 
-3 GLQ L GH P SAQA VAG AFV 
701 TCGCCATCGG G AC AGG AC C C GGTTCCGCTC CGGGAAGCAG TGGGCGGCTG 
AGCGGTAGCC CTGTCCTGGG CCAAGGCGAG GCCCTTCGTC ACCCGCCGAC 
-3 RWRSLVRNREPFCHAAT- 
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7 51 TCAGCACCCA GTCCCCGGAG AGCAGGGATC CCCCACAGAG GTGTGCTCCA 

AGTCGTGGGT CAGGGGCCTC TCGTCCCTAG GGGGTGTCTC CACACGAGGT 

-3 TLVW DGS L LS G GC L HAGD 

801 TCATAGCGAA GGCTGACTTG CCACGGCCAC CGGCCCAAGC TGGTGTCCCG 

AGTATCGCTT CCGACTGAAC GGTGCCGGTG GCCGGGTTCG AC C AC AGGG C 

-3 DYR L SV QWPW R GL S TDR 

851 GCCTCCCACG ATCTTGTCGT CGTCGTCCAC GGGCAGCTTC CTGCGGCCAC 

CGGAGGGTGC TAGAACAGCA GCAGCAGGTG CCCGTCGAAG GACGCCGGTG 
1 ■ — — *~ ■ * « » * ■ ■ ■ • «■- » , * * k i 

-3 GGV1KDDDDVPLKRRGC- 
901 AGTCTTGGTC CGGAGCGTCA CCAGTGGAAC CTGGAACCCA GAGCAGCAGT 

TCAGAACCAG GCCTCGCAGT GGTCACCTTG G AC C TTGGGT CTCGTCGTCA 
-3 CDQ D PAD G TS G PVW LLLV 
9 51 ACCCATAGCA GGAGTGTGTC TGTCTCCATG GTGGCGATCT GGTACCCAGC 

TGGGTATCGT CCTCACACAG ACAGAGGTA C CACCGCTAGA CCATGGGTCG 
-3 • V W L L LT D TEM 
1001 TTCTAGAGAT CTGACGGTTC ACTAAACGAG CTCTGCTTAT ATAGACCTCC 

AAGATCTCTA GACTGCCAAG TGATTTGCTC GAGACGAATA TATCTGGAGG 
1051 CACCGTACAC GCCTACCGCC CATTTGCGTC AACGGGGCGG GGTTATTACG 

GTGGCATGTG CGGATGGCGG GTAAACGCAG TTGCCCCGCC CCAATAATGC 
1101 ACATTTTGGA AAGTCCCGTT GATTTTGGTG CCAAAACAAA CTCCCATTGA 

TGTAAAACCT TTCAGGGCAA CTAAAACCAC GGTTTTGTTT GAGGGTAACT 
1151 CGTCAATGGG GTGGAGACTT GGAAATCCCC GTGAGTCAAA CCGCTATCCA 

GCAGTTACCC CACCTCTGAA CCTTTAGGGG CACTCAGTTT GGCGATAGGT 
1201 CGCCCATTGG TGTACTGCCA AAACCGCATC ACCATGGTAA TAGCGATGAC 

GCGGGTAACC ACATGACGGT TTTGGC GT AG TGGTAC C ATT ATCGCTACTG 
1251 TAATACGTAG ATGTACTGCC AAGTAGGAAA GTCCCGTAAG GTCATGTACT 

ATTATGCATC TACATGACGG TTCATCCTTT CAGGGCATTC CAGTACATGA 
1301 GGGCATAATG CCAGGCGGGC CATTTACCGT CATTGACGTC AATAGGGGGC 

CCCGTATTAC GGTCCGCCCG GTAAATGGCA GTAACTGCAG TTATCCCCCG 
1351 GGACTTGGCA TATGATACAC TTGATGTACT GCCAAGTGGG CAGTTTACCG 

CCTGAACCGT AT AC T ATGTG AACTACATGA CGGTTCACCC GTC AAATGGC 
1401 TAAATACTCC ACCCATTGAC GTCAATGGAA AGTCCCTATT GGCGTTACTA 

ATTTATGAGG TGGGTAACTG CAGTTACCTT TCAGGGATAA CCGCAATGAT 
1451 TGGGAACATA CGTCATTATT GACGTCAATG GGCGGGGGTC GTTGGGCGGT 

ACCCTTGTAT GCAGTAATAA CTGCAGTTAC CCGCCCCCAG CAACCCGCCA 
1501 CAGCCAGGCG GGCCATTTAC C GT AAGTT AT GTAACGCGGA ACTCCATATA 

GTCGGTCCGC CCGGTAAATG GCATTCAATA CATTGCGCCT TGAGGTATAT 
1551 TGGGCTATGA ACTAATGACC CCGTAATTGA TT AC T ATT AA TAACTAGTCA 

ACCCGATACT TGATTACTGG GGCATTAACT AATGATAATT ATTGATCAGT 
1601 ATAATCAATG TCAACATGGC GGTCATATTG GACATGAGCC AATATAAATG 

TATTAGTTAC AGTTGTACCG CCAGTATAAC CTGTACTCGG TTATATTTAC 
1651 TACATATTAT GATATAGATA CAACGTATGC AATGGCCAAT AGCCAATATT 

ATGTATAATA CTATATCTAT GTTGCATACG TT AC CGGTT A TCGGTTATAA 
17 01 GATTTATGCT ATATAACCAA TGACTAATAT GGCTAATTGC CAATATTGAT 

CTAAATACGA TATATTGGTT ACTGATTATA CCGATTAACG GTTATAACTA 
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17 51 TC AATGT AT A GATCTTCCAT ACCTACCAGT TCTGCGCCTG CAGCAATGCA 

AGTTACATAT C T AG AAGGT A TGGATGGTCA AGACGCGGAC GTCGTTACGT 
1801 ACAACGTTGC CCGGATCTGC GATGATAAGC TGTCAAACAT GAGAATTGGT 

TGTTGCAACG GGCCTAGACG CTACTATTCG ACAGTTTGTA CTCTTAACCA 
1851 CGACTAGCTT GGCACGCCAG AAATCCGCGC GGTGGTTTTT GGGGGTCGGG 

GCTGATCGAA CCGTGCGGTC TTTAGGCGCG CCACCAAAAA CCCCCAGCCC 
1901 GGTGTTTGGC AGCCACAGAC GCCCGGTGTT CGTGTCGCGC CAGTACATGC 

CCACAAACCG TCGGTGTCTG CGGGCCACAA GCACAGCGCG GTCATGTACG 
1951 GGTCCATGCC CAGGCCATCC AAAAACCATG GGTCTGTCTG CTCAGTCCAG 

CCAGGTACGG GTCCGGTAGG TTTTTGGTAC CC AG AC AG AC GAGTCAGGTC 
2001 TCGTGGACCA GACCCCACGC AACGCCCAAA ATAATAACCC CCACGAACCA 

AGCACCTGGT CTGGGGTGCG TTGCGGGTTT TATTATTGGG GGTGCTTGGT 
2051 TAAACCATTC CCCATGGGGG ACCCCGTCCC TAACCCACGG GGCCAGTGGC 

ATTTGGTAAG GGGTACCCCC TGGGGCAGGG ATTGGGTGCC CCGGTCACCG 
2101 TATGGCAGGG CCTGCCGCCC CGACGTTGGC TGCGAGCCCT GGGCCTTCAC 

ATACCGTCCC GGACGGCGGG GCTGCAACCG ACGCTCGGGA CCCGGAAGTG 
2151 CCGAACTTGG GGGGTGGGGT GGGGAAAAGG AAGAAACGCG GGCGTATTGG 

GGCTTGAACC CCCCACCCCA CCCCTTTTCC TTCTTTGCGC CCGCATAACC 
2201 CCCCAATGGG GTCTCGGTGG GGTATCGACA GAGTGCCAGC CCTGGGACCG 

GGGGTTACCC CAGAGCCACC CCATAGCTGT CTCACGGTCG GGACCCTGGC 
2251 AACCCCGCGT TTATGAACAA ACGACCCAAC ACCCGTGCGT TTTATTCTGT 

TTGGGGCGCA AATACTTGTT TGCTGGGTTG TGGGCACGCA AAATAAGACA 
2301 CTTTTTATTG CCGTCATAGC GCGGGTTCCT TCCGGTATTG TCTCCTTCCG 

GAAAAATAAC GGCAGTATCG CGCCCAAGGA AGGCCATAAC AGAGGAAGGC 
2351 TGTTTCAGTT AGCCTCCCCC ATCTCCCCTA TTCCTTTGCC C TC GG AC GAG 

ACAAAGTCAA TCGGAGGGGG TAGAGGGGAT AAGGAAACGG GAGCCTGCTC 
2401 TGCTGGGGCG TCGGTTTCCA CTATCGGCGA GTACTTCTAC ACAGCCATCG 

ACGACCCCGC AGCCAAAGGT GATAGCCGCT CATGAAGATG TGTCGGTAGC 
2451 GTCCAGACGG CCGCGCTTCT GCGGGCGATT TGTGTACGCC CGACAGTCCC 

CAGGTCTGCC GGCGCGAAGA CGCCCGCTAA ACACATGCGG GCTGTCAGGG 
2 501 GGCTCCGGAT CGGACGATTG CGTCGCATCG ACCCTGCGCC CAAGCTGCAT 

CCGAGGCCTA GCCTGCTAAC GCAGCGTAGC TGGGACGCGG GTTCGACGTA 
2 551 CATCGAAATT GCCGTCAACC AAGCTCTGAT AGAGTTGGTC AAGACCAATG 

GTAGCTTTAA CGGCAGTTGG TTCGAGACTA TCTCAACCAG TTCTGGTTAC 
2 601 CGGAGCATAT ACGCCCGGAG CCGCGGCGAT CCTGCAAGCT CCGGATGCCT 

GCCTCGTATA TGCGGGCCTC GGCGCCGCTA GGACGTTCGA GGCCTACGGA 
2 651 CCGCTCGAAG TAGCGCGTCT GCTGCTCCAT ACAAGCCAAC CACGGCCTCC 

GGCGAGCTTC ATCGCGCAGA CGACGAGGTA TGTTC GGTTG GTGCCGGAGG 
2701 AGAAGAAGAT GTTGGCGACC TCGTATTGGG AATCCCCGAA CATCGCCTCG 

TCTTCTTCTA CAACCGCTGG AGCATAACCC TTAGGGGCTT GTAGCGGAGC 
2751 CTCCAGTCAA TGACCGCTGT TATGCGGCCA TTGTCCGTCA GGACATTGTT 

GAGGTCAGTT ACTGGCGACA ATACGCCGGT AACAGGCAGT CCTGTAACAA 
2 801 GG AGC CG AAA TCCGCGTGCA CGAGGTGCCG GACTTCGGGG CAGTCCTCGG 

CCTCGGCTTT AGGCGCACGT GCTCCACGGC CTGAAGCCCC GTCAGGAGCC 
2851 CCCAAAGCAT CAGCTCATCG AGAGCCTGCG CGACGGACGC ACTGACGGTG 

GGGTTTCGTA GTCGAGTAGC TCTCGGACGC GCTGCCTGCG TGACTGCCAC 
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2901 TCGTCCATCA CAGTTTGCCA GTGATACACA TGGGGATCAG CAATCGCGCA 

AGCAGGTAGT GTCAAACGGT CACTATGTGT ACCCCTAGTC GTTAGCGCGT 
2951 TATGAAATCA CGCCATGTAG TGTATTGACC GATTCCTTGC GGTCCGAATG 

ATACTTTAGT GCGGTACATC ACATAACTGG CTAAGGAACG CCAGGCTTAC 
3001 GGCCGAACCC GCTCGTCTGG CTAAGATCGG CCGCAGCGAT CGCATCCATG 

CCGGCTTGGG CGAGCAGACC GATTCTAGCC GGCGTCGCTA GCGTAGGTAC 
3 051 GCCTCCGCGA CCGGCTGCAG AACAGCGGGC AGTTCGGTTT CAGGCAGGTC 

CGGAGGCGCT GGCCGACGTC TTGTCGCCCG TCAAGCCAAA GTCCGTCCAG 
3101 TTGCAACGTG ACACCCTGTG CACGGCGGGA GATGCAATAG GTCAGGCTCT 

AACGTTGCAC TGTGGGACAC GTGCCGCCCT CTACGTTATC CAGTCCGAGA 
3151 CGCTGAATTC CCCAATGTCA AGCACTTCCG GAATCGGGAG CGCGGCCGAT 

GCGACTTAAG GGGTTACAGT TCGTGAAGGC CTTAGCCCTC GCGCCGGCTA 
3201 GCAAAGTGCC GATAAACATA ACGATCTTTG TAGAAACCAT CGGCGCAGCT 

CGTTTCACGG CTATTTGTAT TGCTAGAAAC ATCTTTGGTA GCCGCGTCGA 
3251 ATTTACCCGC AGGACATATC CACGCCCTCC TACATCGAAG CTGAAAGCAC 

TAAATGGGCG TCCTGTATAG GTGCGGGAGG ATGTAGCTTC GACTTTCGTG 
3301 GAGATTCTTC GCCCTCCGAG AGCTGCATCA GGTC GG AG AC GCTGTCGAAC 

CTCTAAGAAG CGGGAGGCTC TCGACGTAGT CCAGCCTCTG CGACAGCTTG 
3351 TTTTCGATCA GAAACTTCTC G AC AG AC GTC GCGGTGAGTT CAGGCTTTTT 

AAAAGCTAGT CTTTGAAGAG CTGTCTGCAG CGCCACTCAA GTCCGAAAAA 
3401 CATATCTCAT TGCCCGGGAT CTGCGGCACG CTGTTGACGC TGTTAAGCGG 

GTATAGAGTA ACGGGCCCTA GACGCCGTGC GACAACTGCG ACAATTCGCC 
3451 GTCGCTGCAG GGTCGCTCGG TGTTCGAGGC CACACGCGTC ACCTTAATAT 

CAGCGACGTC CCAGCGAGCC ACAAGCTCCG GTGTGCGCAG TGGAATTATA 
3501 GCGAAGTGGA CCTGGGACCG CGCCGCCCCG ACTGCATCTG CGTGTTCGAA 

CGCTTCACCT GGACCCTGGC GCGGCGGGGC TG AC GT AG AC GCACAAGCTT 
3551 TTCGCCAATG ACAAGACGCT GGGCGGGGTT TGTGTCATCA T AG AAC T AAA 

AAGCGGTTAC TGTTCTGCGA CCCGCCCCAA AC AC AGT AGT ATCTTGATTT 
3 601 GACATGCAAA TATATTTCTT CCGGGGACAC CGCCAGCAAA CGCGAGCAAC 

CTGTACGTTT ATATAAAGAA GGCCCCTGTG GCGGTCGTTT GCGCTCGTTG 
3 651 GGGCCACGGG GATGAAGCAG GGCATGGCGG CCGACGCGCT GGGCTACGTC 

CCCGGTGCCC CTACTTC GTC CCGTACCGCC GGCTGCGCGA CCCGATGCAG 
3701 TTGCTGGCGT TCGCGACGCG AGGCTGGATG GCCTTCCCCA TTATGATTCT 

AACGACCGCA AGCGCTGCGC TCCGACCTAC CGGAAGGGGT AATACTAAGA 
3751 TCTCGCTTCC GGCGGCATCG GGATGCCCGC GTTGCAGGCC ATGCTGTCCA 

AGAGCGAAGG CCGCCGTAGC CCTACGGGCG CAACGTCCGG TACGACAGGT 
3801 GGC AGGT AG A TGACGACCAT CAGGGACAGC TTCAAGGATC GCTCGCGGCT 

CCGTCCATCT ACTGCTGGTA GTCCCTGTCG AAGTTCCTAG CGAGCGCCGA 
3 851 CTTACCAGCC TAACTTCGAT CACTGGACCG CTGATCGTCA CGGCGATTTA 

GAATGGTCGG ATTGAAGCTA GTGACCTGGC GACTAGCAGT GCCGCTAAAT 
3901 TGCCGCCTCG GCGAGCACAT GGAACGGGTT GGCATGGATT GTAGGCGCCG 

ACGGCGGAGC CGCTCGTGTA CCTTGCCCAA CCGTACCTAA CATCCGCGGC 
3951 CCCTATACCT TGTCTGCCTC CCCGCGTTGC GTCGCGGTGC ATGGAGCCGG 

GGGATATGGA ACAGACGGAG GGGCGCAACG CAGCGCCACG TACCTCGGCC 
4001 GCCACCTCGA CCTGAATGGA AGCCGGCGGC ACCTCGCTAA CGGATTCACC 

CGGTGGAGCT GGACTTACCT TCGGCCGCCG TGGAGCGATT GCCTAAGTGG 



4- 
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4 051 ACTCCAAGAA TTGGAGCCAA TCAATTCTTG CGGAGAACTG TGAATGCGCA 

TGAGGTTCTT AACCTCGGTT AGTTAAGAAC GCCTCTTGAC ACTTACGCGT 
4101 AACCAACCCT TGGCAGAACA TATCCATCGC GTCCGCCATC TCCAGCAGCC 

TTGGTTGGGA ACCGTCTTGT ATAGGTAGCG CAGGCGGTAG AGGTCGTCGG 
4151 GCACGCGGCG CAGCAAAAGG CCAGGAACCG TAAAAAGGCC GCGTTGCTGG 

CGTGCGCCGC GTCGTTTTCC GGTCCTTGGC ATTTTTCCGG CGCAACGACC 
4201 CGTTTTTCCA TAGGCTCCGC CCCCCTGACG AGCATCACAA AAATCGACGC 

GCAAAAAGGT ATCCGAGGCG GGGGGACTGC TCGTAGTGTT TTTAGCTGCG 
4251 TCAAGTCAGA GGTGGCGAAA CCCGACAGGA C T AT AAAG AT ACCAGGCGTT 

AGTTCAGTCT CCACCGCTTT GGGCTGTCCT GATATTTCTA TGGTCCGCAA 
4301 TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA 

AGGGGGACCT TCGAGGGAGC ACGCGAGAGG ACAAGGCTGG GACGGCGAAT 
4351 CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAT 

GGCCTATGGA CAGGCGGAAA GAGGGAAGCC CTTCGCACCG CGAAAGAGTA 
4401 AGCTCACGCT GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT 

TCGAGTGCGA CATCCATAGA GTCAAGCCAC ATCCAGCAAG CGAGGTTCGA 
4451 GGGCTGTGTG CACGAACCCC CCGTTCAGCC CGACCGCTGC GCCTTATCCG 

CCCGACACAC GTGCTTGGGG GGCAAGTCGG GCTGGCGACG CGGAATAGGC 
4501 GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT ATCGCCACTG 

CATTGATAGC AGAACTCAGG TTGGGCCATT CTGTGCTGAA TAGCGGTGAC 
4551 GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC 

CGTCGTCGGT GACCATTGTC CTAATCGTCT CGCTCCATAC ATCCGCCACG 
4601 T AC AG AGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG 

ATGTCTCAAG AACTTCACCA CCGGATTGAT GCCGATGTGA TCTTCCTGTC 
4651 TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT 

ATAAACCATA G AC GC GAG AC GACTTCGGTC AATGGAAGCC TTTTTCTCAA 
4701 GGTAGCTCTT GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT 

CCATCGAGAA CTAGGCCGTT TGTTTGGTGG CGACCATCGC CACCAAAAAA 
4751 TGTTTGCAAG CAGCAGATTA CGCGCAGAAA AAAAGGATCT CAAGAAGATC 

ACAAACGTTC GTCGTCTAAT GCGCGTCTTT TTTTCCTAGA GTTC TTC TAG 
4801 CTTTGATCTT TTCTACGGGG TCTGACGCTC AGTGGAACGA AAACTCACGT 

GAAACTAGAA AAGATGCCCC AGACTGCGAG TCACCTTGCT TTTGAGTGCA 
4851 TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA CCTAGATCCT 

ATTC C C T AAA AC C AGTACTC TAATAGTTTT TCCTAGAAGT GGATCTAGGA 
4901 TTTAAATTAA AAATGAAGTT TTAAATCAAT C T AAAGT AT A TATGAGTAAA 

AAATTTAATT TTTACTTCAA AATTTAGTTA GATTTCATAT ATACTCATTT 
4951 CTTGGTC TG A CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG 

GAACCAGACT GTCAATGGTT ACGAATTAGT CACTCCGTGG ATAGAGTCGC 
5001 ATCTGTCTAT TTCGTTCATC CATAGTTGCC TGACTCCCCG TC GTGT AG AT 

TAGACAGATA AAGCAAGTAG GTATCAACGG ACTGAGGGGC AGCACATCTA 
5051 AACTACGATA CGGGAGGGCT TAC C ATCTGG CCCCAGTGCT GCAATGATAC 

TTGATGCTAT GCCCTCCCGA ATGGT AG AC C GGGGTCACGA CGTTACTATG 
5101 CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAGCAAT AAACCAGCCA 

GCGCTCTGGG TGCGAGTGGC CGAGGTCTAA AT AGTC GTT A TTTGGTCGGT 
5151 GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT CCGCCTCCAT 

CGGCCTTCCC GGCTCGCGTC TTCACCAGGA CGTTGAAATA GGCGGAGGTA 
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5201 CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 

GGTCAGATAA TTAACAACGG CCCTTCGATC TCATTCATCA AGCGGTCAAT 
52 51 ATAGTTTGCG CAACGTTGTT GCCATTGCTG CAGGCATCGT GGTGTCACGC 

TATCAAACGC GTTGCAACAA CGGTAACGAC GTCCGTAGCA CCACAGTGCG 
5301 TCGTCGTTTG GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG 

AGCAGCAAAC CATACCGAAG TAAGTCGAGG CCAAGGGTTG CTAGTTCCGC 
5351 AGTTACATGA TCCCCCATGT TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC 

TCAATGTACT AGGGGGTACA ACACGTTTTT TCGCCAATCG AGGAAGCCAG 
5401 CTCCGATCGT TGTC AG AAGT AAGTTGGCCG CAGTGTTATC ACTCATGGTT 

GAGGCTAGCA ACAGTCTTCA TTCAACCGGC GTCACAATAG TGAGTACCAA 
5451 ATGGCAGCAC TGCATAATTC TC TT AC TGTC ATGCCATCCG TAAGATGCTT 

TACCGTCGTG ACGTATTAAG AG AATG AC AG TACGGTAGGC ATTCTACGAA 
5501 TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 

AAGACACTGA CCACTCATGA GTTGGTTCAG TAAGACTCTT ATCACATACG 
55 51 GGCGACCGAG TTGCTCTTGC CCGGCGTCAA CACGGGATAA TACCGCGCCA 

CCGCTGGCTC AACGAGAACG GGCCGCAGTT GTGCCCTATT ATGGCGCGGT 
5601 CATAGCAGAA CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG 

GTATCGTCTT GAAATTTTCA CGAGTAGTAA CCTTTTGCAA GAAGCCCCGC 
5651 AAAACTCTCA AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA 

TTTTGAGAGT TCCTAGAATG GCGACAACTC TAGGTCAAGC TACATTGGGT 
5701 CTCGTGCACC CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT 

GAGCACGTGG GTTG AC TAG A AGTCGTAGAA AATGAAAGTG GTCGCAAAGA 
5751 GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG GAATAAGGGC 

CCCACTCGTT TTTGTCCTTC CGTTTTACGG CGTTTTTTCC CTTATTCCCG 
5801 GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA 

CTGTGCCTTT ACAACTTATG AGTATGAGAA GGAAAAAGTT ATAATAACTT 
5851 GCATTTATCA GGGTTATTGT CTCATGAGCG GAT AC AT ATT TGAATGTATT 

CGTAAATAGT CCCAATAACA GAGTACTCGC CTATGTATAA AC TT AC AT AA 
5901 TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC 

ATC TTTTTAT TTGTTTATCC CCAAGGCGCG TGTAAAGGGG CTTTTCACGG 
5951 ACCTGACGTC TAAGAAAC C A TTATTATCAT G AC ATT AAC C TATAAAAATA 

TGGACTGCAG ATTCTTTGGT AATAATAGTA CTGTAATTGG ATATTTTTAT 
6001 GGCGTATCAC GAGGCCCTTT CGTCTTCAAG AATTCTCATG TTTGACAGCT 

CCGCATAGTG CTCCGGGAAA GCAGAAGTTC TTAAGAGTAC AAAC TGTC G A 
6051 TATCATCGAT AAGCTGATCC TCACAGGCCG CACCCAGCTT TTCTTCCGTT 

ATAGTAGCTA TTCGACTAGG AGTGTCCGGC GTGGGTCGAA AAGAAGGCAA 
6101 GCCCCAGTAG CATCTCTGTC TGGTG AC CTT GAAGAGGAAG AGGAGGGGTC 

CGGGGTCATC GTAGAGACAG ACCACTGGAA CTTCTCCTTC TCCTCCCCAG 
6151 CCGAGAATCC CCATCCCTAC CGTCCAGCAA AAAGGGGGAC GAGGAATTTG 

GGCTCTTAGG GGTAGGGATG GCAGGTCGTT TTTCCCCCTG CTCCTTAAAC 
6201 AGGCCTGGCT TGAGGCTCAG GACGCAAATC TTGAGGATGT TCAGCGGGAG 

TCCGGACCGA ACTCCGAGTC CTGCGTTTAG AACTCCTACA AGTCGCCCTC 
6251 TTTTCCGGGC TGCGAGTAAT TGGTGATGAG GACGAGGATG GTTCGGAGGA 

AAAAGGCCCG ACGCTCATTA ACCACTACTC CTGCTCCTAC CAAGCCTCCT 
6301 TGGGGAATTT TCAGACCTGG ATC TGTC TGA C AGCG AC CAT GAAGGGGATG 

ACCCCTTAAA AGTCTGGACC TAGACAGACT GTCGCTGGTA CTTCCCCTAC 
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63 51 AGGGTGGGGG GGCTGTTGGA 

TCCCACCCCC CCGACAACCT 
6401 CTGAGCGTCG TCTAATAAAG 

GACTCGCAGC AGATTATTTC 
6451 TGTCTGACGA GGGGCCAGGT 

ACAGACTGCT CCCCGGTCCA 
6501 GGAGACACAT CTGGACCAGA 

CCTCTGTGTA GACCTGGTCT 
6551 AGGGGGTGAT AACCATGGAC 

TCCCCCACTA TTGGTACCTG 
6601 GCGGAAGACC AGGAGCCCCG 

CGCCTTCTGG TCCTCGGGGC 
6651 GATGGTGTCC GGAGACCCCA 

CTACCACAGG CCTCTGGGGT 
6701 GACCCACGGT GGAACAGGAG 

CTGGGTGCCA CCTTGTCCTC 
67 51 GAGGGGCAGG AGCAGGAGGA 

CTCCCCGTCC TCGTCCTCCT 
6801 GCAGGAGGGG CAGGAGGGGC 

CGTCCTCCCC GTCCTCCCCG 
6851 AGGGGCAGGA GGGGCAGGAG 

TCCCCGTCCT CCCCGTCCTC 
6901 GAGGAGGGGC AGGAGGGGCA 

CTCCTCCCCG TCCTCCCCGT 
6951 GGGGCAGGAG CAGGAGGAGG 

CCCCGTCCTC GTCCTCCTCC 
7001 AGGAGCAGGA GGAGGGGCAG 

TCCTCGTCCT CCTCCCCGTC 
7051 GGGCAGGAGC AGGAGGGGCA 

CCCGTCCTCG TCCTCCCCGT 
7101 GCAGGAGCAG GAGGAGGGGC 

CGTCCTCGTC CTCCTCCCCG 
7151 GGCAGGAGCA GGAGGGGCAG 

CCGTCCTCGT CCTCCCCGTC 
7201 GAGGGGCAGG AGCAGGAGGG 

CTCCCCGTCC TCGTCCTCCC 
7251 GGGGCAGGAG CAGGAGGAGG 

CCCCGTCCTC GTCCTCCTCC 
7301 AGGAGGGGCA GGAGCAGGAG 

TCCTCCCCGT CCTCGTCCTC 
7 351 GAGGGGCAGG AGCAGGAGGG 

CTCCCCGTCC TCGTCCTCCC 
7 401 GGAGCAGGAG GGGCAGGAGC 

CCTCGTCCTC CCCGTCCTCG 
7451 TGGAGGCCGG GGTCGAGGAG 

ACCTCCGGCC CCAGCTCCTC 



GGGGGCAGGA GTCTGCACTC CCTGTATTCA 
CCCCCGTCCT CAGACGTGAG GGACATAAGT 
ATGTCTATTG ATCTCTTTTA GTGTGAATCA 
TACAGATAAC TAGAGAAAAT CACACTTAGT 
ACAGGACCTG GAAATGGCCT AGGAGAGAAG 
TGTCCTGGAC CTTTACCGGA TCCTCTCTTC 
AGGCTCCGGC GGCAGTGGAC CTCAAAGAAG 
TCCGAGGCCG CCGTCACCTG GAGTTTCTTC 
GAGGACGGGG AAGAGGACGA GGACGAGGAG 
CTCCTGCCCC TTCTCCTGCT CCTGCTCCTC 
GGCGGCTCAG GATCAGGGCC AAGACATAGA 
CCGCCGAGTC CTAGTCCCGG TTCTGTATCT 
AAAACGTCCA AGTTGCATTG GCTGCAAAGG 
TTTTGCAGGT TCAACGTAAC CGACGTTTCC 
CAGGAGCAGG AGCGGGAGGG GCAGGAGCAG 
GTCCTCGTCC TCGCCCTCCC CGTCCTCGTC 
GGGGCAGGAG CAGGAGGAGG GGCAGGAGGG 
CCCCGTCCTC GTCCTCCTCC CCGTCCTCCC 
AGGAGCAGGA GGAGGGGCAG GAGCAGGAGG 
TCCTCGTCCT CCTCCCCGTC CTCGTCCTCC 
GGGCAGGAGC AGGAGGAGGG GCAGGAGCAG 
CCCGTCCTCG TCCTCCTCCC CGTCCTCGTC 
GGAGCAGGAG GAGGGGCAGG AGGGGCAGGA 
CCTCGTCCTC CTCCCCGTCC TCCCCGTCCT 
GGCAGGAGCA GGAGGAGGGG CAGGAGGGGC 
CCGTCCTCGT CCTCCTCCCC GTCCTCCCCG 
GAGGGGCAGG AGGGGCAGGA GCAGGAGGAG 
CTCCCCGTCC TCCCCGTCCT CGTCCTCCTC 
GGAGGGGCAG GAGGGGCAGG AGCAGGAGGG 
CCTCCCCGTC CTCCCCGTCC TCGTCCTCCC 
AGGAGGGGCA GGAGGGGCAG GAGCAGGAGG 
TCCTCCCCGT CCTCCCCGTC CTCGTCCTCC 
GAGCAGGAGG GGCAGGAGCA GGAGGGGCAG 
CTCGTCCTCC CCGTCCTCGT CCTCCCCGTC 
GCAGGAGGGG CAGGAGCAGG AGGGGCAGGA 
CGTCCTCCCC GTCCTCGTCC TCCCCGTCCT 
GGCAGGAGGG GCAGGAGCAG GAGGAGGGGC 
CCGTCCTCCC CGTCCTCGTC CTCCTCCCCG 
GGGCAGGAGG GGCAGGAGCA GGAGGGGCAG 
CCCGTCCTCC CCGTCCTCGT CCTCCCCGTC 
GCAGGAGGGG CAGGAGCAGG AGGAGGGGCA 
CGTCCTCCCC GTCCTCGTCC TCCTCCCCGT 
AGGAGGTGGA GGCCGGGGTC GAGGAGGCAG 
TCCTCCACCT CCGGCCCCAG CTCCTCCGTC 
GTAGTGGAGG CCGGGGTCGA GGAGGTAGTG 
CATCACCTCC GGCCCCAGCT CCTCCATCAC 
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7501 GAGGCCGCCG GGGTAGAGGA C GTGAAAG AG CCAGGGGGGG AAGTCGTGAA 

CTCCGGCGGC CCCATCTCCT GCACTTTCTC GGTCCCCCCC TTCAGCACTT 
7551 AGAGCCAGGG GGAGAGGTCG TGGACGTGGA GAAAAGAGGC CCAGGAGTCC 

TCTCGGTCCC CCTCTCCAGC ACCTGCACCT CTTTTCTCCG GGTCCTCAGG 
7 601 CAGTAGTCAG TCATCATCAT CCGGGTCTCC ACCGCGCAGG CCCCCTCCAG 

GTCATCAGTC AGTAGTAGTA GGCCCAGAGG TGGCGCGTCC GGGGGAGGTC 
7651 GTAGAAGGCC ATTTTTCCAC CCTGTAGGGG AAGCCGATTA TTTTGAATAC 

CATCTTCCGG TAAAAAGGTG GGACATCCCC TTCGGCTAAT AAAACTTATG 
7701 CACCAAGAAG GTGGCCCAGA TGGTGAGCCT GACGTGCCCC CGGGAGCGAT 

GTGGTTCTTC CACCGGGTCT ACCACTCGGA CTGCACGGGG GCCCTCGCTA 
7751 AGAGCAGGGC CCCGCAGATG ACCCAGGAGA AGGCCCAAGC ACTGGACCCC 

TCTCGTCCCG GGGCGTCTAC TGGGTCCTCT TCCGGGTTCG TGACCTGGGG 
7801 GGGGTCAGGG TGATGGAGGC AGGCGCAAAA AAGGAGGGTG GTTTGGAAAG 

CCCCAGTCCC ACTACCTCCG TCCGCGTTTT TTCCTCCCAC CAAACCTTTC 
7851 CATCGTGGTC AAGGAGGTTC CAACCCGAAA TTTGAGAACA TTGCAGAAGG 

GTAGCACCAG TTCCTCCAAG GTTGGGCTTT AAACTCTTGT AACGTCTTCC 
7901 TTTAAGAGCT CTCCTGGCTA GGAGTCACGT AGAAAGGACT ACCGACGAAG 

AAATTCTCGA GAGGACCGAT CCTCAGTGCA TCTTTCCTGA TGGCTGCTTC 
7951 GAACTTGGGT CGCCGGTGTG TTCGTATATG GAGGTAGTAA GACCTCCCTT 

CTTGAACCCA GCGGCCACAC AAGCATATAC CTC CATC ATT CTGGAGGGAA 
8001 TACAACCTAA GGCGAGGAAC TGCCCTTGCT ATTCCACAAT GTCGTCTTAC 

ATGTTGGATT CCGCTCCTTG ACGGGAACGA TAAGGTGTTA CAGCAGAATG 
8051 ACCATTGAGT CGTCTCCCCT TTGGAATGGC CCCTGGACCC GGCCCACAAC 

TGGTAACTCA GCAGAGGGGA AACCTTACCG GGGACCTGGG CCGGGTGTTG 
8101 CTGGCCCGCT AAGGGAGTCC ATTGTC TGTT ATTTCATGGT CTTTTTACAA 

GACCGGGCGA TTCCCTCAGG TAACAGACAA TAAAGTAC C A GAAAAATGTT 
8151 ACTCATATAT TTGCTGAGGT TTTGAAGGAT GCGATTAAGG ACCTTGTTAT 

TGAGTATATA AACGACTCCA AAACTTC CTA CGCTAATTCC TGGAACAATA 
8201 GACAAAGCCC GCTCCTACCT GCAATATCAG GGTGACTGTG TGCAGCTTTG 

CTGTTTCGGG CGAGGATGGA CGTTATAGTC CCACTGACAC ACGTCGAAAC 
8251 ACGATGGAGT AGATTTGCCT CCCTGGTTTC CACCTATGGT GGAAGGGGCT 

TGCTACCTCA TCTAAACGGA GGGACCAAAG GTGGATACCA CCTTCCCCGA 
8301 GCCGCGGAGG GTGATGACGG AG ATG AC GG A GATGAAGGAG GTGATGGAGA 

CGGCGCCTCC CACTACTGCC TCTACTGCCT CTACTTCCTC CACTACCTCT 
8351 TGAGGGTGAG GAAGGGCAGG AGTGATGTAA CTTGTTAGGA GACGCCCTCA 

ACTCCCACTC CTTCCCGTCC TCACTACATT GAACAATCCT CTGCGGGAGT 
8401 ATCGTATTAA AAGCCGTGTA TTCCCCCGCA CTAAAGAATA AATCCCCAGT 

TAGCATAATT TTCGGCACAT AAGGGGGCGT GATTTCTTAT TTAGGGGTCA 
8451 AGACATCATG CGTGCTGTTG GTGTATTTCT GGCCATCTGT CTTGTCACCA 

TCTGTAGTAC GCACGACAAC CACATAAAGA CCGGTAGACA GAACAGTGGT 
8501 TTTTCGTCCT CCCAACATGG GGCAATTGGG CATACCCATG TTGTCACGTC 

AAAAGCAGGA GGGTTGTACC CCGTTAACCC GTATGGGTAC AACAGTGCAG 
8551 ACTCAGCTCC GCGCTCAACA CCTTCTCGCG TTGGAAAACA TTAGCGACAT 

TGAGTCGAGG CGCGAGTTGT GGAAGAGCGC AAC CTTTTGT AATCGCTGTA 
8601 TTACCTGGTG AGCAATCAGA CATGCGACGG CTTTAGCCTG GCCTCCTTAA 

AATGGACCAC TCGTTAGTCT GTACGCTGCC GAAATCGGAC CGGAGGAATT 
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8651 ATTCACCTAA GAATGGGAGC AACCAGCATG CAGGAAAAGG ACAAGCAGCG 

TAAGTGGATT CTTACCCTCG TTGGTCGTAC GTCCTTTTCC TGTTCGTCGC 
8701 AAAATTCACG CCCCCTTGGG AGGTGGCGGC ATATGCAAAG GATAGCACTC 

TTTTAAGTGC GGGGGAACCC TCCACCGCCG TAT AC GTTTC CTATCGTGAG 
8751 CCACTCTACT AC TGGGT ATC ATATGCTGAC TGTATATGCA TGAGGATAGC 

GGTGAGATGA TGACC CAT AG TAT AC G AC TG AC AT AT AC GT ACTCCTATCG 
8801 ATATGCTACC CGGATACAGA TTAGGATAGC AT AT ACT AC C CAGATATAGA 

TATACGATGG GCCTATGTCT AATCCTATCG TATATGATGG GTCTATATCT 
8851 TTAGGATAGC ATATGCTACC CAGATATAGA TTAGGATAGC CTATGCTACC 

AATCCTATCG TATACGATGG GTCTATATCT AATCCTATCG GATACGATGG 
8901 C AG AT AT AAA TTAGGATAGC AT AT AC T AC C CAGATATAGA TTAGGATAGC 

GTCTATATTT AATCCTATCG TATATGATGG GTCTATATCT AATCCTATCG 
8951 ATATGCTACC CAGATATAGA TTAGGATAGC CTATGCTACC CAGATATAGA 

TATACGATGG GTCTATATCT AATCCTATCG GATACGATGG GTCTATATCT 
9001 TTAGGATAGC ATATGCTACC CAGATATAGA TTAGGATAGC ATATGCTATC 

AATCCTATCG TATACGATGG GTCTATATCT AATCCTATCG TATACGATAG 
9051 CAGATATTTG GGTAGTATAT GCTACCCAGA TATAAATTAG GATAGCATAT 

GTCTATAAAC C C ATC AT AT A CGATGGGTCT ATATTTAATC CTATCGTATA 
9101 ACTACCCTAA TCTCTATTAG GATAGCATAT GCTACCCGGA TACAGATTAG 

TGATGGGATT AGAGATAATC CTATCGTATA CGATGGGCCT ATGTCTAATC 
9151 GATAGCATAT ACTACCCAGA TATAGATTAG GATAGCATAT GCTACCCAGA 

CTATCGTATA TGATGGGTCT ATATCTAATC CTATCGTATA CGATGGGTCT 
9201 TATAGATTAG GAT AG C C TAT GCTACCCAGA TATAAATTAG GATAGCATAT 

ATATCTAATC CTATCGGATA CGATGGGTCT ATATTTAATC CTATCGTATA 
9251 ACTACCCAGA TATAGATTAG GATAGCATAT GCTACCCAGA TATAGATTAG 

TGATGGGTCT ATATCTAATC CTATCGTATA CGATGGGTCT ATATCTAATC 
9301 GATAGCCTAT GCTACCCAGA TATAGATTAG GATAGCATAT GCTATCCAGA 

CTATCGGATA CGATGGGTCT ATATCTAATC CTATCGTATA CGATAGGTCT 
9351 TATTTGGGTA GTATATGCTA CCCATGGCAA CATTAGCCCA CCGTGCTCTC 

ATAAACCCAT CATATACGAT GGGTACCGTT GTAATCGGGT GGCACGAGAG 
9401 AGCGACCTCG TGAATATGAG GACCAACAAC CCTGTGCTTG GCGCTCAGGC 

TCGCTGGAGC ACTTATACTC CTGGTTGTTG GGACACGAAC CGCGAGTCCG 
9451 GCAAGTGTGT GTAATTTGTC CTCCAGATCG CAGCAATCGC GCCCCTATCT 

CGTTCACACA C ATT AAAC AG GAGGTCTAGC GTCGTTAGCG CGGGGATAGA 
9501 TGGCCCGCCC ACCTACTTAT GCAGGTATTC CCCGGGGTGC CATTAGTGGT 

ACCGGGCGGG TGGATGAATA CGTCCATAAG GGGCCCCACG GTAATCACCA 
9551 TTTGTGGGCA AGTGGTTTGA CCGCAGTGGT TAGCGGGGTT ACAATCAGCC 

AAACACCCGT TCACCAAACT GGCGTCACCA ATCGCCCCAA TGTTAGTCGG 
9 601 AAGTTATTAC ACCCTTATTT TACAGTCCAA AACCGCAGGG CGGCGTGTGG 

TTCAATAATG TGGGAATAAA ATGTCAGGTT TTGGCGTCCC GCCGCACACC 
9 651 GGGCTGACGC GTGCCCCCAC TCCACAATTT CAAAAAAAAG AGTGGCCACT 

CCCGACTGCG CACGGGGGTG AGGTGTTAAA GTTTTTTTTC TCACCGGTGA 
9701 TGTCTTTGTT TATGGGCCCC ATTGGCGTGG AGCCCCGTTT AATTTTCGGG 

ACAGAAACAA AT AC C C GGGG TAACCGCACC TCGGGGCAAA TTAAAAGCCC 
9751 GGTGTTAGAG ACAACCAGTG GAGTCCGCTG CTGTCGGCGT CCACTCTCTT 

CCACAATCTC TGTTGGTCAC CTCAGGCGAC GACAGCCGCA GGTGAGAGAA 




9801 TCCCCTTGTT ACAAATAGAG 

AGGGGAACAA TGTTTATCTC 
9851 CTGCCTGGGA CACATCTTAA 

G AC GG AC C C T GTGTAGAATT 
9901 TGTGTTGCCC ATAGCCATAA 

ACACAACGGG TATCGGTATT 
9951 GGCTTGTCCC CACCCCATGG 

CCGAACAGGG GTGGGGTACC 
10001 TCATTCCTAC ACTAGTATTT 

AGTAAGGATG TGATCATAAA 
10051 GGTGTCATAG CACAATGCCA 

CCACAGTATC GTGTTACGGT 
10101 GGGGGCGTCA CCTGAAACCT 

CCCCCGCAGT GGACTTTGGA 
10151 GTTCACAACT CAGCAGTTAT 

CAAGTGTTGA GTCGTCAATA 
10201 GCAGGCGAAG ATTCAGGAGA 

CGTCCGCTTC TAAGTCCTCT 
10251 CTGCCCTTGT GACTAAAATG 

GACGGGAACA CTGATTTTAC 
10301 TGTAAATAAA ACCGTGACAG 

ACATTTATTT TGGCACTGTC 
10351 AGGACCCTTT TACTAACCCT 

TC C TGGG AAA ATGATTGGGA 
10401 ACATATGCTA TTGAATTAGG 

TGTATACGAT AACTTAATCC 
10451 GGAAGCATAT GCTACCCGTT 

CCTTCGTATA CGATGGGCAA 
10501 ATTGCTAATG CCCTCTTGAG 

T AAC G ATT AC GGGAGAACTC 
10551 TCTGATTGAC GTTGGTGTAG 

AGACTAACTG CAACCACATC 
10 6 01 GTACATGTCC CCCAGCATTG 

CATGTACAGG GGGTCGTAAC 
10651 AAAGGCAATG TTGTGTTGCA 

TTTCCGTTAC AACACAACGT 
10701 GAAAGCCACT CAGTGTTGGC 

CTTTCGGTGA GTCACAACCG 
10751 ACTACAGTCA GAGAACCCCT 

TGATGTCAGT CTCTTGGGGA 
10801 GGAACAGGGC CCAGTTGGCA 

CCTTGTCCCG GGTCAACCGT 
10851 GCACTGCCCC GCGAAGAAGG 

CGTGACGGGG CGCTTCTTCC 
10901 TCCGGCGGCG GGGCTCTAGA 

AGGCCGCCGC CCCGAGATCT 
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TGTAACAACA TGGTTCACCT GTCTTGGTCC 
AC ATTGTTGT ACCAAGTGGA CAGAACCAGG 
TAACCCCAGT ATCATATTGC ACTAGGATTA 
ATTGGGGTCA TAGTATAACG TGATCCTAAT 
ATTCGTGTGA GATGGACATC CAGTCTTTAC 
T AAGC AC AC T CTACCTGTAG GTCAGAAATG 
ATTTCTATTG TTAAAGATAT TCAGAATGTT 
TAAAGATAAC AATTTCTATA AGTCTTACAA 
ATTGCCCAAG GGGTTTGTGA GGGTTATATT 
TAACGGGTTC CCCAAACACT CCCAATATAA 
CCACTGAACC CCCCGTCCAA ATTTTATTCT 
GGTGACTTGG GGGGCAGGTT TAAAATAAGA 
TGTTTTCGAG CACCTCACAT ACACCTTACT 
ACAAAAGCTC GTGGAGTGTA TGTGGAATGA 
TCTATTAGCT AAACGAAGGA GAATGAAGAA 
AG AT AATC G A TTTGCTTCCT CTTACTTCTT 
GTTCACTGCC CGCTCCTTGA TCTTCAGCCA 
CAAGTGACGG GCGAGGAACT AGAAGTCGGT 
GTTC ACTAC C CTCGTGGAAT CCTGACCCCA 
CAAGTGATGG GAGCACCTTA GGACTGGGGT 
CTCATGGGGT GGGAGATATC GCTGTTCCTT 
GAGTACCCCA C C C TC TAT AG CGACAAGGAA 
AATT C GAT AG CATATGCTTC C CGTTGGGT A 
TTAAGCTATC GTATACGAAG GGCAACCCAT 
GTTAGTCTGG ATAGTATATA CTACTACCCG 
CAATCAGACC TATCATATAT GATGATGGGC 
TAGGGTTAAC AAGGGGGCCT TATAAACACT 
ATCCCAATTG TTCCCCCGGA ATATTTGTGA 
GGTCCGCTTA TCGGTAGCTA CACAGGCCCC 
CCAGGCGAAT AGCCATCGAT GTGTCCGGGG 
CCTCCCGTAG TCTTCCTGGG CCCCTGGGAG 
GGAGGGCATC AGAAGGACCC GGGGACCCTC 
GTGTAAGAGC TTCAGCCAAG AGTTACACAT 
CACATTCTCG AAGTCGGTTC TCAATGTGTA 
GTCCACAGAC TGCAAAGTCT GCTCCAGGAT 
CAGGTGTCTG AC GTTTC AG A CGAGGTCCTA 
AAATGTG C AC ATCCATTTAT AAGGATGTCA 
TTTACACGTG TAGGTAAATA TTCCTACAGT 
TTGTGTTTGG TCCCCCCCCG TGTCACATGT 
AACACAAACC AGGGGGGGGC AC AGTGT AC A 
AGTTGT AC C A ACCAACTGAA GGGATTACAT 
TCAACATGGT TGGTTGACTT CCCTAATGTA 
GGCAGAGATG CCGTAGTCAG GTTT AGTTC G 
CCGTCTCTAC GGCATCAGTC CAAATCAAGC 
GTCGACCGGT CATGGCTGCG CCCCGACACC 
CAGCTGGCCA GTACCGACGC GGGGCTGTGG 
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10951 CGCCAACACC CGCTGACGCG CCCTGACGGG CTTGTCTGCT CCCGGCATCC 

GCGGTTGTGG GCGACTGCGC GGGACTGCCC GAACAGACGA GGGCCGTAGG 
11001 GCTTACAGAC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT GTCAGAGGTT 

CGAATGTCTG TTCGACACTG GCAGAGGCCC TCGACGTACA CAGTCTCCAA 
11051 TTC AC CGTC A TCACCGAAAC GCGCGAGGCA GCCGGATCAT AATCAGCCAT 

AAGTGGCAGT AGTGGCTTTG CGCGCTCCGT CGGCCTAGTA TTAGTCGGTA 
11101 ACCACATTTG TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CCACCTCCCC 

TGGTGTAAAC ATCTCCAAAA TGAACGAAAT TTTTTGGAGG GGTGGAGGGG 
11151 CTGAACCTGA AAC AT AAAAT GAATGCAATT GTTGTTGTTA AC TTGTTTAT 

GACTTGGACT TTGTATTTTA CTTACGTTAA CAACAACAAT TGAACAAATA 
11201 TGCAGCTTAT AATGGTT AC A AATAAAGCAA TAGCATCACA AATTTCACAA 

ACGTCGAATA TTACCAATGT TTATTTCGTT ATCGTAGTGT TTAAAGTGTT 
112 51 ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 

TATTTCGTAA AAAAAGTGAC GTAAGATCAA CACCAAACAG GTTTGAGTAG 
11301 AATGTATCTT ATCATGTCTG GATCCCACGT GCAGGCGGGG AGGCGGCCCA 

TTACATAGAA T AGT AC AG AC CTAGGGTGCA CGTCCGCCCC TCCGCCGGGT 
11351 AAGGGAGATC CGACTCGTCT GAGGGCGAAG GCGAAGACGC GGAAGAGGCC 

TTCCCTCTAG GCTGAGCAGA CTCCCGCTTC CGCTTCTGCG CCTTCTCCGG 
11401 GCAGAGCCGG CAGCAGGCCG CGGGAAGGAA GGTCCGCTGG ATTGAGGGCC 

CGTCTCGGCC GTCGTCCGGC GCCCTTCCTT CCAGGCGACC TAACTCCCGG 
11451 GAAGGGACGT AGCAGAAGGA CGTCCCGCGC AGAATCCAGG TGGCAACACA 

CTTCCCTGCA TCGTCTTCCT GCAGGGCGCG TCTTAGGTCC ACCGTTGTGT 
11501 GGCGAGCAGC CAAGGAAAGG ACGATGATTT CCCCGACAAC ACCACGGAAT 

CCGCTCGTCG GTTCCTTTCC TGCTACTAAA GGGGC TGTTG TGGTGCCTTA 
11551 TGTCAGTGCC CAACAGCCGA GCCCCTGTCC AGCAGCGGGC AAGGCAGGCG 

ACAGTCACGG GTTGTCGGCT CGGGGACAGG TCGTCGCCCG TTCCGTCCGC 
11601 GCGATGAGTT CCGCCGTGGC AATAGGGAGG GGGAAAGCGA AAGTCCCGGA 

CGCTACTCAA GGCGGCACCG TTATCCCTCC CCCTTTCGCT TTCAGGGCCT 
11651 AAGGAGCTGA CAGGTGGTGG CAATGCCCCA AC C AGTGGGG GTTGCGTCAG 

TTCCTCGACT GTCCACCACC GTT AC GGGGT TGGTCACCCC CAACGCAGTC 
11701 CAAACACAGT GCACACCACG CCACGTTGCC TGACAACGGG CCACAACTCC 

GTTTGTGTCA CGTGTGGTGC GGTGCAACGG ACTGTTGCCC GGTGTTGAGG 
11751 TC AT AAAG AG ACAGCAACCA GGATTTATAC AAGGAGGAGA AAATGAAAGC 

AGTATTTCTC TGTCGTTGGT CCTAAATATG TTCCTCCTCT TTTACTTTCG 
11801 CATACGGGAA GCAATAGCAT GATACAAAGG CATTAAAGCA GCGTATCCAC 

GTATGCCCTT CGTTATCGTA CTATGTTTCC GTAATTTCGT CGCATAGGTG 
11851 ATAGCGTAAA AGGAGCAACA TAGTTAAGAA TACCAGTCAA TCTTTCACAA 

TATCGCATTT TCCTCGTTGT ATCAATTCTT ATGGTCAGTT AGAAAGTGTT 
119 01 ATTTTGTAAT CCAGAGGTTG ATTC 

TAAAACATTA GGTCTCCAAC TAAG 
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Figure 12: Western Blot of Hepsin expressed in 293EBNA cells. 4 days 
after transf ection, conditioned medium was harvested. 10 jal of 
conditioned medium was loaded on a 10% Tris-Glycine gel. Untreated 
293EBNA cells served as negative control, 5 ng of purified hepsin ED/EK 
(expressed from insect cells) served as positive control. After 
electrophoresis, proteins were transferred onto Nitrocellulose membrane 
at 100 V for lh. The membrane was then probed with anti-V5 antibody at 
37C° for lh after blocking with 5% dry milk in TTBS buffer for lh. 
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Figure 1 5A: Western Slot Blot on the hybridomas 1 1C1 & 47A5 are WB positive against 
Hepsin-ED-EK protein. 
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Figure 15B: Western Slot Blot on the hybridomas 31 C1 and 38E2 are WB positive against 
Hepsin-ED-EK protein. 
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Figure 15C: Western Slot Blot on the hybridomas 37G10, 46D12 & 14C7 are WB 
positive against Hepsin-ED-EK protein. 
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Figure 15D: Western Slot Blot on the hybridomas 14C7 & 72H6 are WB positive against 
Hepsin-ED-EK protein. 
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Human wild- type hepsin amino acid sequence: 

maqkeggrtv pccsrpkvaa ltagtllllt aigaaswaiv avllrsdqep lypvqvssad 61 
arlmvfdkte gtwrllcssr snarvaglsc eemgflralt hseldvrtag angtsgffcv 121 
degrlphtqr llevisvcdc prgrflaaic qdcgrrklpv drivggrdts lgrwpwqvsl 181 
rydgahlcgg sllsgdwvlt aahcfpernr vlsrwrvfag avaqasphgl qlgvqawyh 241 
ggylpfrdpn seensndial vhlssplplt eyiqpvclpa agqalvdgki ctvtgwgntq 301 
yygqqagvlq earvpiisnd vcngadfygn qikpkmfcag ypeggidacq gdsggpfvce 3 61 
dsisrtprwr lcgivswgtg calaqkpgvy tkvsdfrewi fqaikthsea sgmvtql 

the cytoplasmic domain: Metl to Lysl7 
the transmembrane domain: Vail 8 to Leu44 
the ectodomain: Arg45 to Leu417 



Figure 18 



Hep-ED-EK structure (modified soluble hepsin with substitute activation sequence) 

1 RSDQEPLYPV QVSSADARLM VFDKTEGTWR LLCSSRSNAR VAGLSCEEMG FLRALTHSEL 

61 DVRTAGANGT SGFFCVDEGR LPHTQRLLEV ISVCDCPRGR FLAAICQDCG RRKL P VDDDD 

121 KIVGGRDTSL GRWPWQVSLR YDGAHLCGGS LLSGDWVLTA AHCFPERNRV L S RWRVF AG A 

181 VAQASPHGLQ LGVQAWYHG GYLPFRDPNS EENSNDIALV HLSSPLPLTE YIQPVCLPAA 

241 GQALVDGKIC TVTGWGNTQY YGQQAGVLQE ARVPIISNDV CNGADFYGNQ IKPKMFCAGY 

3 01 PEGGIDACQG DSGGPFVCED SISRTPRWRL CGIVSWGTGC ALAQKPGVYT KVSDFREWIF 
3 61 QAIKTHSEAS GMVTQL EFGK PIPNPLLGLD STRTGHHHHH H* 



Cytoplasmic domain: absent 
Transmembrane domain: absent 

Modified activation domain : 117-121 (underlined) 
V5 and 6-His Tag: 377-401 



